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(57) Abstract 

The identification and characterisation of human and rat Islet-Brain I (IBl) is disclosed, a transcriptional acdvator that is involved in 
the control of the OLUH and insulin genes by inteiacting with homologous cis^gulatory elements of the GLUT2 and insulin promoters. 
The rat IBl cDNA encodes a 714 amino acid protein and the human IBl cDNA a 71 1 amino acid protein. The use of IBl polypeptides, 
nucleic acid, agonists and antagonists in the treatment or diagnosis of diabetes, neurological diseases such as dementia and/or parkinsonism, 
the inhibition/promotion of apoptosis and cancer is disclosed. 
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Transcription Factor Islet -Brain 1 (IBl) 

Field of the Invention 

The present invention relates to identification and 

5 characterisation of Islet-Brain 1 (IBl) , a transcriptional 

activator that is involved in the control of the GLUT2 and 
insulin genes by interacting with homologous cis- regulatory 
elements of the GLUT2 and insulin promoters, and to 
materials and methods deriving from this work. In 

0 particular, the present invention relates to the uses of 

IBl nucleic acid, IBl polypeptides, IBl antagonists and 
antibodies in the diagnosis, and prophylactic and 
therapeutic treatment of conditions such as diabetes, 
neurological diseases such as dementia and/or parkinsonism, 

5 cancer and in the promotion or inhibition of apoptosis. 

Background of the Invention 

The GLUT2 facilitated glucose transporter isoform is 
a membrane protein present in the pancreatic iS-insulin- 

0 secreting cells, the basolateral membrane of intestinal and 

kidney absorptive cells, in hepatocytes and in a subset of 
neurons (21,31,44) . In these cells, GLUT2 catalyzes the 
transepithelial transport of glucose. In pancreatic 
islets, GLUT2 allows a rapid equilibration of glucose 

5 between the extracellular space and the interior of the 

cells and it may play a crucial role in the glucose 
signalling mechanism leading to insulin secretion (43) . 
However, the relative importance of GLUT2 in the sensing of 
the /S-pancreatic cells to glucose remains debated. In 

0 human ^-cells, the level of expression of GLUT2 is low and 

the intracellular glucokinase activity seems to be the 
rate-limiting step in the glycolytic pathway (5,11). On 
the other hand, insulinoma cells that had lost their normal 
glucose responsiveness have low GLUT2 content, but some 

5 glucose sensitivity may be recovered after reintroducing 

GLUT2 expression through stable transfection of these cells 
(10,16). Furthermore, transgenic mice that express GLUT2 
antisense RN?^ driven by the insulin promoter led to an 8 0% 
reduction in GLUT2 which was paralleled by a decreased 
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glucose- induced insulin secretory response and by the onset 
of diabetes (48) . These observations are critical since 
several experimental models of diabetes have shown that 
GLUT2 expression is dramatically reduced specifically in 
the pancreatic ^-cells, and that this mechanism could 
participate to the onset of the disease (18,29,30,32,45- 
47) . Therefore, while GLUT2 levels are unchanged or even 
upregulated in several tissues such as the liver and the 
intestine during the hyperglycemic conditions observed in 
diabetes, the same gene undergoes a drastic dysregulation 
only in the pancreatic islets. 

A fragment of the murine GLUT2 promoter has been 
cloned and shown to be glucose-responsive when transfected 
into differentiated insulin-producing cells or into 
hepatocytes (35,36,52). Important cis- regulatory sequences 
were identified within this promoter region including a 
functionally responsive PDX-1 element, a cyclic AMP 
responsive element, and three cis elements termed GTI, GTII 
and GTIII (3,36,53). The presence of GTI, II and III are 
both sufficient and necessary to confer pancreatic- specific 
expression to a reporter gene in vitro or in vivo, using a 
transgenic mice approach (3,51). GTI and GTIII have been 
previously shown to bind distinct, but ubiquitously 
expressed trans-acting factors. 

Summary of the Invention 

The present invention is based on successful 
expression cloning of a transcription factor that binds to 
the GTII element of the GLUT2 and insulin genes from a 
0 differentiated insulin-secreting cDNA library. In part, 

the success of this exercise was based on the inventors' 
realisation of the importance of GTII and the library they 
constructed to find the IBl gene. The IBl polypeptides 
described herein are relatively large, and cloning them was 
5 achieved by the construction by the inventors of a high 

quality cDNA library for expression cloning. 

This factor is abundantly expressed in the pancreatic 
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islets and in the brain and has been named IB-1 for Islet - 
Brain 1. Both human and rat IBl genes and polypeptides 
have been obtained. The rat IBl cDNA encodes a 714 amino 
acid protein and the human IBl cDNA a 711 amino acid 
5 protein. The cDNAs encoding the rat and human IBl 
polypeptides have a 94% sequence identity and the 
polypeptides have a 97% amino acid sequence identity (see 
the alignment of the sequences in figure ID) , and have a 
proline-rich region and a putative basic helix- loop-helix 
10 domain (bHLH) . The IBl gene is highly expressed in the 
pancreatic islets and in the brain and to a much lesser 
extent in the heart and the kidney. In the Langerhans 
islets and in )S-cell lines, these transcripts are 
translated into immunodetectable cytoplasmic and nuclear 
15 protein. When tested in vitro, IBl bound specifically to 

the GTII cis element of the GLUT2 gene and to an homologous 
regulatory sequence of the insulin promoter termed RIPE3 . 
This rat insulin promoter element 3 (RIPE3) is an important 
enhancer sequence sufficient to confer jS-cell specific 
20 expression to the insulin gene. Functionally, IBl 

transactivated the proximal region of the GLUT2 promoter 
linked to a luciferase reporter gene and was also a potent 
activator of the insulin gene. This effect is mediated 
through the RIPE3 sequence as demonstrated by the 
25 observation that multiple copies of this enhancer sequence 

cloned 5' of an heterologous promoter was transactivated by 
an ejqpression vector encoding IBl in transient transfection 
studies. IBl appears to function only in insulin-secreting 
cells as no trans activation was observed in non-pancreatic 
30 or in glucagon-producing cell lines. These data 
demonstrate the presence of a novel transcriptional 
activator abundantly expressed in the endocrine pancreas 
and which participates to the proper jS-cell specific 
control of the GLUT2 and the insulin genes through 
35 homologous sequences present in both promoters. 

The nucleic acid and amino acid sequences of rat IBl 
are shown in figure lA. The human IBl cDNA is shown in 
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figxire IE, with the translated amino acid sequence shown in 
figure IF. The human IBl gene is located on chromosome 11 
at llpll.l2 on the LDB cytogenic map. The IBl gene is 
adjacent to markers D11S134 and D11S3979. 
5 The human cDNA was constructed as RNA using tissue 

obtained from a surgically removed human insulinoma. Poly 
A"" RNA was extracted and a cDNA library constructed and 
subsequently screened with a radiolabelled rat IBl cDNA 
probe. This allowed the inventors to isolate the human 

10 cDNA encoding IBl. This cDNA was then used as a probe to 

clone the human IBl gene from a bacterial artificial 
chromosome (BAG) . Several clones were obtained and part of 
them sequenced. The above protocol was then used to 
complete the sequencing of the human IBl nucleic acid shown 

15 in figure IE. 

The human IBl gene is multiexonic and is located in 
chromosome llpll.l2. The chromosomal mapping was obtained 
by FISH experiments and PGR of hybrid cells (hamster-human) 
using as a probe the multiexonic IBl gene. IBl is 

2 0 expressed in the brain of rat, mouse and human species and 

in tissues with a high degree of similarity such as the 
endocrine pancreas (many neuronal features are present in 
the insulin /S-cells) . 

In a first aspect, the present invention provides a 
25 substance which is an isolated polypeptide comprising a 

polypeptide having the amino acid sequence set out in 
figure lA or figure IF. 

In a further aspect, the present invention provides a 
substance which is an isolated polypeptide having greater 

3 0 than 80% amino acid sequence identity with the amino acid 

sequence set out in figure lA or IF. 

In a further aspect, the present invention provides a 
substance which is a polypeptide which is a mutant, 
variant, derivative or allele of any one of the above 
35 polypeptides. 

In a further aspect, the present invention provides a 
substance which is a fragment of a polypeptide having the 
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amino acid sequence set out in figure lA or IF which 
exhibits a biological property of full length IBl protein. 
In one embodiment, the fragment includes the domain from 
amino acide 566-612 of the sequence shown in figure lA or 
5 the domain from amino acids 563-609 of the sequence shown 

in figure IF, or an active portion of that domain. In an 
alternative embodiment, the present invention provides a 
polypeptide which is a protein interaction domain having 
the sequence shown in figure lA from amino acids 56 6-612 

10 and in figure IE from amino acids 563-609. As this domain 

is believed to be responsible for some of the interactions 
between IBl and other polypeptides, it can be used in. 
methods of screening for binding partners, e.g. peptides 
which could act as inhibitors of IBl . 

15 In a further aspect, the present invention provides 

isolated nucleic acid molecules encoding any one of the 
above polypeptides. Examples of such nucleic acid 
sequences are the nucleic acid sequences set out in figures 
lA and IE. The present invention also include nucleic 

20 molecules having greater than a 90% sequence homology with 

the nucleic acid sequence of figure lA or IE. 

In further aspects, the present invention provides an 
expression vector comprising the above IBl nucleic acid 
operably linked to control sequences to direct its 

25 expression, and host cells transformed with the vectors. 

The present invention also includes a method of producing 
IBl polypeptides comprising culturing the host cells and 
isolating the IBl polypeptide thus produced. 

In a further aspect, the present invention provides an 

30 expression vector comprising IBl nucleic acid for use in 

methods of gene therapy, e.g. in the treatment of patients 
unable to produce sufficient IBl or to engineer cell lines 
capable of producing IBl. 

In a further aspect, the present invention provides a 

35 cell line for transplantation into a patient, the cell line 

being transformed with nucleic acid encoding an IBl 
protein, and being capable of producing IBl polypeptide. 
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The expression of IBl in a transformed cell line can affect 
endogenous genes such as the insulin or GLUT2 genes. In 
one embodiment, the cell lines can be encapsulated, e.g. in 
a biocompatible polymer so that the IBl produced by the 
5 cells line can be secreted into the patient, while 

preventing rejection by the immune system of the host. 
Methods for encapsulating cells in biocompatible polymers 
are described in W093/16687 and W096/31199. 

In a further aspect, the present invention provides a 
10 pharmaceutical composition comprising an IBl nucleic acid 

molecule. 

In a further aspect, the present invention provides a 
pharmaceutical composition comprising one or more IBl 
polypeptides as defined above. 
15 In further aspects, the present invention provides the 

above IBl polypeptides and nucleic acid molecules for use 
in methods of medical treatment. The present invention 
further provides the use of the IBl polypeptides in the 
preparation of medicament for activating the GLUT2 or 
20 insulin promoters leading to the production of GLUT2 or 

insulin. Preferably, the activation takes place in a cell 
specific manner, e.g. in jS-cells. This could be used in 
the treatment of conditions treatable using insulin or 
GLUT2, such as diabetes. Since IBl is also present in 
25 muscle tissue such as the heart it may function by 

modulating insulin sensitivity and ameliorating abnormal 
glucose disposal in diabetic patients. 

IBl could also be used as an agent which maintains a 
state of differentiation within in a cell, i.e. acts as an 
30 anti-apoptotic agent. Thus, IBl can be used as an anti- 

neoplastic agent, e.g. as a drug to control or treat some 
cancers. As an example, insulinomas are human tumours 
which undergo dedif f erentiation and divide. Thus, IBl 
antagonists could be used to attack tumour cells, while IBl 
35 could be used to protect surrounding healthy tissue from 

the effects of treating tumour cells, using IBl antagonists 
and/or conventional radiotherapy or chemotherapy. Thus, 
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IBl could act as a differentiation agent to treat these 
cells. A further application of IBl antagonists is in the 
treatment of brain tumours, such as glioblastomas which are 
typically \intreatable using conventional medicine. 
5 IBl is similar to JIP-1, a cytoplasmic protein 

identified by Dickens et al, Science, 277:693-696, 1997. 
However, IBl differs from JIP-1 by the insertion of 4 7 
amino acids in the carboxy terminal portion of the protein, 
and has a 97% amino acid homology over the remaining 

10 sequence of rat IBl shown in figure lA (i.e. excluding the 

insert not present in JIP-1) . As overexpression of JIP-1 
in neuronal cells inhibits apoptosis of the cells, IBl 
could be used to suppress apoptosis in cells, e.g. stress - 
induced apoptosis induced in neurons. This stress 

15 activation can be caused by ultraviolet (uv) radiation, 

anoxia, hypoglycemia, cytokines such as IL-6, or by trauma. 
This in turn supports the present inventors' earlier 
suggestion to the use of IBl to prevent cell death that 
occurs in diseases such as dementia, neurodegenerative 

20 diseases, ischemia of the heart, myocardial infarction (IBl 

is present in heart), and post trauma, e.g. in sections of 
the spine in paraplegia and in neuronal trauma. 

In a further aspect, the present invention provides 
the use of the IBl polypeptides in screening candidate 

25 compounds for IBl biological activity, e.g. to find 

peptidyl or non-peptidyl mimetics of the IBl polypeptides 
to develop as lead compounds in pharmaceutical research. 

In a further aspect, the present invention provides 
antibodies capable of specifically binding to the above IBl 

30 polypeptides. These antibodies can be used in assays to 

detect and quantify the presence of IBl polypeptide, in 
methods of purifying IBl polypeptides, and in 
phannaceutical compositions, e.g. to neutralize IBl in 
conditions in which its overexpression has deleterious 

35 effect, or to inhibit the anti-apoptotic effect of IBl in 

diseased tissue such as tumours. Accordingly, antagonists 
such as antibodies which block the expression of IBl or 
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neutralize IBl in the tissues in which it is overexpressed 
can be used for treating such conditions and are included 
within the present invention. Polyclonal antibodies to . the 
N-terminal portion (residues 1-280) of the IBl protein of 
5 figure lA are exemplified below. 

In a further aspect, the present invention method for 
determining the presence of IBl nucleic acid and/or 
mutations within an IBl nucleic acid sequence in a test 
sample comprising detecting the hybridization of test 

10 sample nucleic acid to a nucleic acid probe based on the 

IBl nucleic acid sequences set out in figures lA or IE. 

In a further aspect, the present invention provides 
the use of IBl nucleic acid as defined above in the design 
of antisense oligonucleotides to restrict IBl expression in 

15 a population of cells, i.e. phosphorothiolated or 

chloresterol linked oligonucleotides which can facilitate 
internalization and stabilization of the oligonucleotides. 

In a further aspect, the present invention provides a 
method of amplifying a nucleic acid test sample comprising 

2 0 priming a nucleic acid polymerase reaction with nucleic 

acid encoding an IBl polypeptide as defined above. The 
present invention also provides the use of the above 
nucleic acid in the search for mutations in the IBl genes, 
e.g. using techniques such as single stranded conformation 

25 polymorphism (SSCP) . 

These and other aspects of the present invention are 
described in more detail below. 

By way of example, embodiments of the present 
invention will now be described in more detail with 

30 reference to the accompanying figures. 

Brief Description of the Fioures 
Figure 1 

IBl cDNA and predicted protein sequence . 
35 A. Nucleotide sequence of rat IBl cDNA and its predicted 

amino acid sequence. The nucleic acid residues are 
numbered from the nucleotide in the cDNA, and the amino 
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acid residues are numbered from the beginning of the long 
open reading frame. Computer assisted analysis of the 
protein sequence indicates the presence of highly 
helicoidal structure (residues 31-61 and 114-125) and a 
5 proline-rich region (residues 292-366) in amino- terminal 

part of the protein, a putative nuclear localization signal 
(underlined twice) as well as a putative DNA-binding domain 
and bHLH dimerization domain (underlined once) . 

B. Amino acid sequence comparison of IBl with other bHLH 
10 proteins. Amino acid sequence were aligned to maximize 

homology within the bHLH region. Shaded amino acids are 
conserved among bHLH proteins, 

C. A schematic diagram of the IBl protein with putative 
domains indicated. Note the 47 amino acid insertion (black 

15 box) which differentiates IBl from JIP-1. 

D. Comparison of the rat and human IBl translated amino 
acid sequences.il = identity, bold = 47 amino acid insert in 
IBl; • = no amino acid present. 

E. Complete nucleotide sequence of human IBl cDNA/ • = 
20 exon-intron junction. 

F. Complete amino acid sequence of human IBl polypeptide. 

Figure 2 

Tissue-specific expression of IBl gene. 

25 A. The distribution of IBl transcript in tissues and cell 

lines was detemnined by Northern blot analysis using 10 fig 
of total RNA prepared from INS-1 and RIN5F (two insulin- 
producing /S-cell lines) , from InRl-G9 (a glucagon-producing 
a-cell line) and several rat tissues. IBl transcripts of 

30 3,0 and 3.2 kb were detected only in the insulin-secreting 

cell lines and in the brain and to a lesser extent in the 
heart and kidney. The blot was stripped and rehybridized 
with a jS-actin probe (bottom) . 

B. Five micrograms of poly(A+) RNA prepared from 2 
35 different insulin-secreting cell lines, from rat liver, 

kidney and pancreas were similarly analyzed for IBl gene 
expression. IBl transcripts were detected in a cell- and 
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tissue-specific manner. 

C. A total of 5 fig of RNA obtained from isolated rat 
pancreatic islets incubated in 2 . 8 mM or 3 0 mM glucose for 
14 hours were analyzed by Northern blotting together with 

5 rat liver and adipose tissue RNAs, IBl is abundantly 

e^qjressed in the isolated islets and its expression is not 
regulated by glucose. 

D. IBl gene expression in the rat brain. A total of 10 
/ig RNA extracted from the cortex (lane 1) , pituitary gland 

10 (lane 2), hypothalamus (lane 3), cerebellum (lane 4) and 

medulla (lane 5) were separated in a formaldehyde gel and 
analyzed by Northern blot for IBl presence. The two IBl 
transcripts were detected at high abundance in the cortex 
and the hypothalamus regions. The same blot was 

15 subsequently rehybridized with jS-actin (bottom) . 

Figure 3 

The IBl protein is detected in adult rat tissues. 

A. Immunoblot analysis of IBl. A rabbit polyclonal 
20 antibody (or- IBl) was raised towards the N- terminal part of 

the recombinant protein (aa 1-280) and affinity purified. 
Western blot analysis of /8TC3 whole cell extracts with the 
Of- IBl antibody demonstrated the presence of a 120 kDa 
product which was undetected with the preimmune serum 
25 (CTRL) . 

B. Determination of the apparent molecular weight of IBl. 
In vitro translated IBl cDNA in the sense (S, T3 RNA- 
polymerase) or the antisense (AS, T7 RNA-polymerase) 
orientation in presence of ^^S labelled methionine was 

30 separated by SDS-PAGE electrophoresis. An 120 kDa product 

is detected only in the cDNA translated in the sense 
orientation. 

C. A plasmid containing the IBl cDNA driven by a CMV 
promoter or its parent vector was transiently transfected 

35 into COS-7 cells and crude cellular extracts (20 /zg) of 

these transfected cells analyzed by Western blotting. 
Using the a-IBl antibodies, an approximately 120kDa protein 
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was detected only in the transfected cells overexpressing 
IBl. 

D. Similar experiments were carried out in transiently 
transfected COS- 7 cells with the pCMV-IBl vector and 

5 cytoplasmic (CE) or nuclear (NE) extracts prepared 4 8 hours 

after transf ection. By Western blot analysis, IBl is 
detected in the cytoplasm and the nucleus of the 
transfected cells. 

E. 2 0 )Ltg of crude cellular extracts from several rat 
10 tissues and of the insulin-secreting (3-cell line )8TC3 were 

analyzed by Western blotting using the a-IBl antibodies. 
IBl protein was detected in the brain and the insulin- 
secreting cell line. 



15 Figfure 4 

Similar DNA-binding activity between the GTII cis element 
of the GLUT2 promoter and the insulin enhancer secruence 
RIPE3 . 

A. Nucleic acid sequences comparison between the murine 
20 GTII cis regulatory element and the rat insulin promoter 

element 3 {RIPE3) . Some sequence identity is depicted and 
correspond, in part, to the RIPESb element. 

B. Southwestern experiments were conducted using INS-1 
nuclear extracts separated by SDS-PAGE gel electrophoresis 

25 and transferred to nitrocellulose membrane subsequently 

incubated with concatanated GTII, GTIII and RIPE3 labelled 
oligonucleotides. No specific binding was detected with 
the GTIII probe whereas the GTII and RIPE3 probes bind to 
an approximately 120 kDa protein and with less specificity 

30 to a factor present at 40 kDa, also detectable with the 

GTIII probe. 

C. Plasmids containing the IBl cDNA driven by a CMV 
promoter (pCMV-IBl) or its parent vector (pCMV) was 
transiently transfected into COS-7 cells and crude cellular 

35 extracts (2 0 /zg) of these transfected cells were analyzed 

by Southwestern with the labeled GTII probe. The 12 0 kDa 
expressed protein is detected only in the transfected cells 
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with the eukaryotic expression vector containing IBl cDNA. 
D. Gel retardation analysis conducted with, the GTII and 
RIPE3 probes using 0TC2 nuclear extracts. The RrPE3- 
binding activity is competed with an 100-fold excess of 
5 unlabelled RIPE3 or GTII oligonucleotides but not with an 

unrelated sequence (GTI) . Conversely, the GTII -binding 
activity is competed with a 1000-fold excess of unlabelled 
RIPE3 oligonucleotides but not with an unrelated sequence 
(GTI) . 

10 

Figure 5 

Activation of the insulin enhancer constructs and the GLUT2 
promoter bv IBl 

A. Two /xg of the eukaryotic expression vector containing 
15 the IBl cDNA (PBKS/IBl) or its parent vector (PBKS) were 

transiently transfected into |3TC3 cells together with 1 /ig 
of the promoterless vector encoding the lucif erase gene 
(pGL3) or -338 bp of the murine GLUT2 promoter cloned into 
the pGL3 vector (-338LUC) or -410 bp of the rat insulin I 

20 promoter similarly cloned into pGL3 (-410LUC) . IBl 

transactivates the GLUT2 promoter (1.6 ± 0.1 (SEM) over 
basal) but is a potent transactivator of the insulin 
promoter (3.8 ± 0.8 (SEM) fold increase over basal). This 
effect is ^-cell specific as it was not observed in a 

25 glucagon-producing cell line (InRl-G9, data not shown) or 

in an unrelated cells (COS-3) , 

B. Several exonuclease III deletions constrxicts of the 
rat insulin promoter linked to a CAT reporter gene were 
transiently transfected into )3TC3 cells in the presence of 

30 PBKS or PBKS/IBI. IBl transact ivated the insulin promoter 

and this effect was maximal within the -316 to -159 bp of 
the gene. Representative study done in duplicate in three 
independent experiments, normalized by protein content. 

C. Five copies of the RIPE3 motif were multimerized 5' of 
35 a SV40 promoter linked to a lucif erase reporter gene. This 

construct was transfected into jSTC3 cells in the presence 
of the PBKS or the PBKS/IBl construct. IBl transact ivated 
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the RIPE3-1UC construct in these cells (2.9 ± 0.3 (SEM) 
fold increase over basal) . Representative experiment done 
12 times, in duplicate and normalized by protein content 
and/or the co-transf ection of a HSK/TK-Renilla luciferase 
5 construct as internal control . 

Figure 6 

Structure and organization of the human IBl gene, the human 
IBl cDNA and the human IBl pseudogene . 

0 The 3' end secjuence of the human IBl cDNA was obtained by 

screening a human insulinoma cDNA library, whereas the 5' 
end was obtained by RACE approach (upper lane) or by 
comparing the genomic sequence {obtained by subcloning a 
BAG clone) with the sequence of the rat IBl cDNA, as 

5 described in the Methods. The human IBl gene contains 12 

exons (black rectangles) separated by 11 introns. The 
functional domains of IBl, as deduced from computer 
analysis, are given within the cDNA rectangle; a denotes 
alpha-helicoidl structure; NTS : nuclear translocation 

0 signal : P-rich : proline rich region ; HLH : helix- loop- 

helix structure ; PID : phosphotyrosine- interaction-domain . 
Sequence of the IBl pseudogene was obtained by sequence 
analysis of part of a 4.8 kh fragment isolated by 
subcloning of a BAG clone. 

5 

Figure 7 

Southern Blot Analysis 

This figure shows Southern blot analysis of the human IBl 
gene. 

0 

Detailed Description 

IBl nucleic acid 

^'IBl nucleic acid" includes a nucleic acid molecule 
which has a nucleotide sequence encoding a polypeptide 
5 which includes the amino acid sequence shown in figure lA, 

ID or IF. 

The IBl coding sequence may be that shown in figure lA 
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or IE, a complementary nucleic acid sequence, or it may be 
a mutant, variant, derivative or allele of these sequences. 
The sequence may differ from that shown by a change which 
is one or more of addition, insertion, deletion and 
5 substitution of one or more nucleotides of the sequence 

shown. Changes to a nucleotide sequence may result in an 
amino acid change at the protein level, or not, as 
determined by the genetic code . 

Thus, nucleic acid according to the present invention 

10 may include a sequence different from the sequence shown in 

figure lA or IE yet encode a polypeptide with the same 
amino acid sequence. The amino acid sequence of the 
coTT^lete rat IBl polypeptide shown in figure lA consists of 
714 amino acids. The complete human IBl cDNA sequence is 

15 set out in figure IE, with the translated amino acid 

sequence consisting of 711 amino acids set out in figure 
IF. Human and rat IBl share a 94% nucleic acid sequence 
identity and a 97% amino acid sequence identity. 

On the other hand, the encoded polypeptide may 

20 comprise an amino acid sequence which differs by one or 

more amino acid residues from the amino acid sequence shown 
in figures lA, ID or IF, Nucleic acid encoding a 
polypeptide which is an amino acid secjuence mutant, 
variant, derivative or allele of the sequence shown in 

25 figures lA, ID or IF is further provided by the present 

invention. Such polypeptides are discussed below. Nucleic 
acid encoding such a polypeptide may show greater than 
about 60% homology with the coding sequence shown in 
figures lA or IE greater than about 70% homology, greater 

3 0 than about 80% homology, greater than about 90% homology, 

greater than about 95% homology, greater than about 98% 
homology, or greater than about 99% homology. 

The present invention also includes fragments of the 
IBl nucleic acid sequences described herein, the fragments 

35 preferably being at least 12, 15, 30, 45, 60, or 12 0 

nucleotides in length. 

Generally, nucleic acid according to the present 
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invention is provided as an isolate, in isolated and/or 
purified fonn, or free or substantially free of material 
with which it is naturally associated, such as free or 
substantially free of nucleic acid flanking the gene in the 
5 human genome, except possibly one or more regulatory 

sequence (s) for expression. Nucleic acid may be wholly or 
partially synthetic and may include genomic DNA, cDNA or 
RNA. Where nucleic acid according to the invention 
includes RNA, reference to the sequence shown should be 

10 construed as reference to the RNA equivalent, with U 

substituted for T. 

Nucleic acid sequences encoding all or part of the IBl 
gene and/or its regulatory elements can be readily prepared 
by the skilled person using the information and references 

15 contained herein and techniques known in the art (for 

example, see Sambrook, Fritsch and Maniatis, ^'Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, 1989, and Ausubel et al. Short Protocols in 
Molecular Biology, John Wiley and Sons, 1992) . These 

2 0 techniques include (i) the use of the polymerase chain 

reaction (PCR) to amplify samples of such nucleic acid, 
e,g. from genomic sources, (ii) chemical synthesis, or 
(iii) amplification in JE;. coli . Modifications to the IBl 
sequences can be made, e.g. using site directed 

25 mutagenesis, to provide expression of modified IBl 

polypeptide or to take account of codon preference in the 
host cells used to express the nucleic acid. 

In order to obtain expression of the IBl nucleic acid 
sequences, the sequences can be incorporated in a vector 

30 having control sequences operably linked to the IBl nucleic 

acid to control its expression. The vectors may include 
other sequences such as promoters or enhancers to drive the 
expression of the inserted nucleic acid, nucleic acid 
sequences so that the IBl polypeptide is produced as a 

35 fusion and/or nucleic acid encoding secretion signals so 

that the polypeptide produced in the host cell is secreted 
from the cell. IBl polypeptide can then be obtained by 
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transforming the vectors into host cells in which the 
vector is functional, culturing the host cells so that the 
IBl polypeptide is produced and recovering the IBl 
polypeptide from the host cells or the surrounding medium. 
5 Prokaryotic and eukaryotic cells are used for this purpose 

in the art, including strains of E. coli, yeast, and 
eukaryotic cells such as COS or CHO cells. The choice of 
host cell can be used to control the properties of the IBl 
polypeptide expressed in those cells, e.g. controlling 

10 where the polypeptide is deposited in the host cells or 

affecting properties such as its glycosylation and 
phosphorylation . 

PGR techniques for the amplification of nucleic acid 
are described in US Patent No. 4,683,195. In general, such 

15 techniques require that sequence information from the ends 

of the target sequence is knovm to allow suitable forward 
and reverse oligonucleotide primers to be designed to be 
identical or similar to the polynucleotide sequence that is 
the target for the anplif ication. PGR comprises steps of 

20 denaturation of template nucleic acid (if double-stranded) , 

annealing of primer to target, and polymerisation. The 
nucleic acid probed or used as template in the 
amplification reaction may be genomic DNA, cDNA or RNA. PGR 
can be used to amplify specific sequences from genomic DNA, 

25 specific RNA sequences and cDNA transcribed from mRNA, 

bacteriophage or plasmid sequences. The IBl nucleic acid 
sequences provided herein readily allow the skilled person 
to design PGR primers. References for the general use of 
PGR techniques include Mullis et al, Cold Spring Harbor 

30 Symp. Quant. Biol., 51:263, (1987), Ehrlich (ed) , PGR 

Technology, Stockton Press, NY, 1989, Ehrlich et al. 
Science, 252:1643-1650, (1991), '^PCR protocols; A Guide to 
Methods and Applications", Eds. Innis et al, Academic 
Press, New York, (1990) . 

35 Also included within the scope of the invention are 

antisense oligonucleotide sequences based on the IBl 
nucleic acid sequences described herein, particularly to 
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block the synthesis of IBl in situations where IBl 
overexpression has a deleterious effect, or where it i^s 
desirable to inhibit the ant i-apopt otic effect of IBl, e.g. 
in the treatment of cancer. Antisense oligonucleotides may 
5 be designed to hybridize to the complementary sequence of 

nucleic acid, pre-mRNA or mature mRKA, interfering with the 
production of polypeptide encoded by a given DNA sequence 
(e.g. either native IBl polypeptide or a mutant form 
thereof) , so that its expression is reduce or prevented 

10 altogether. In addition to the IBl coding sequence, 

antisense techniques can be used to target the control 
sequences of the IBl gene, e.g. in the 5' flanking sequence 
of the IBl coding sequence, whereby the antisense 
oligonucleotides can interfere with IBl control sequences. 

15 The construction of antisense sequences and their use is 

described in Peyman and Ulman, Chemical Reviews, 90:543- 
584, (1990), Crooke, Ann. Rev. Pharmacol. Toxicol., 32:329- 
376, (1992), and Zamecnik and Stephenson, P.N.A.S, 75:280- 
284, (1974) , 

2 0 The nucleic acid sequences provided in figures lA and 

IE are useful for identifying nucleic acid of interest (and 
which may be according to the present invention) in a test 
sample. The present invention provides a method of 
obtaining nucleic acid of interest, the method including 

25 hybridization of a probe having the sequence shown in 

figures lA or IE or a complementary sequence, to target 
nucleic acid. 

Hybridization is generally followed by identification 
of successful hybridization and isolation of nucleic acid 

30 which has hybridized to the probe, which may involve one or 

more steps of PGR. 

Nucleic acid according to the present invention is 
obtainable using one or more oligonucleotide probes or 
primers designed to hybridize with one or more fragments of 

35 the nucleic acid sequence shown in figures lA or IE, 

particularly fragments of relatively rare sequence, based 
on codon usage or statistical analysis. A primer designed 
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to hybridize with a fragment of the nucleic acid sequence 
shown in the above figures may ,be used in conjunction with 
one or more oligonucleotides designed to hybridize to a 
sequence in a cloning vector within which target nucleic 
5 acid has been cloned, or in so-called "RACE" (rapid 

amplification of cDNA ends) in which cDNA's in a library 
are ligated to an oligonucleotide linker and PGR is 
performed using a primer which hybridizes with the sequence 
shown in figure lA or IE and a primer which hybridizes to 

10 the oligonucleotide linker. 

Such oligonucleotide probes or primers, as well as the 
full-length sequence (and mutants, alleles, variants and 
derivatives) are also useful in screening a test sample 
containing nucleic acid for the presence of alleles, 

15 mutants and variants, especially those that lead to the 

production of inactive forms of IBl protein, the probes 
hybridizing with a target sequence from a sample obtained 
from the individual being tested. The conditions of the 
hybridization can be controlled to minimise non-specific 

20 binding, and preferably stringent to moderately stringent 

hybridization conditions are preferred. The skilled person 
is readily able to design such probes, label them and 
devise suitable conditions for the hybridization reactions, 
assisted by textbooks such as Sambrook et al (1989) and 

25 Ausubel et al (1992). 

As well as determining the presence of polymorphisms 
or mutations in the IBl sequence, the probes may also be 
used to determine whether mRNA encoding IBl is present in 
a cell or tissue. 

3 0 Nucleic acid isolated and/or purified from one or more 

cells (e.g. human) or a nucleic acid library derived from 
nucleic acid isolated and/or purified from cells (e.g. a 
cDNA library derived from mRNA isolated from the cells) , 
may be probed under conditions for selective hybridization 

35 and/or subjected to a specific nucleic acid amplification 

reaction such as the polymerase chain reaction (PGR) , The 
human IBl gene is located on chromosome 11 (llpll.l2) and 
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contains multiple exons , Polymorphisms within this gene 
may be used as markers for human genetic diseases such as 
diabetes, and neurological disorders including 
neurodegenerative diseases, such as dementia. Parkinsonism, 
5 and Alzheimer's disease; refractory epilepsia (familial 

forms) ; neuronal disabilities such as speech disorders and 
memory alteration; neuronal or glial tumours such as 
neuroblastoma or glioblastoma; autoimmune disease affecting 
the CNS such as systematic lupus erythromatosis; diabetes; 
10 heart diseases such as myocardial infarct and ischemia as 
apoptosis is involved in areas surrounding infarct; and 
brain attack, in particular in the prevention of apoptosis 
in ischemia or infarct . 

In the context of cloning, it may be necessary for one 
15 or more gene fragments to be ligated to generate a full- 
length coding sequence. Also, where a full-length encoding 
nucleic acid molecule has not been obtained, a smaller 
molecule representing part of the full molecule, may be 
used to obtain full-length clones. Inserts may be prepared 
20 from partial cDNA clones and used to screen cDNA libraries. 

The full-length clones isolated may be subcloned into 
expression vectors and activity assayed by transfection 
into suitable host cells, e.g. with a reporter plasmid. 

A method may include hybridization of one or more 
25 (e.g. two) probes or primers to target nucleic acid. Where 

the nucleic acid is double -stranded DNA, hybridization will 
generally be preceded by denaturation to produce single- 
stranded DNA. The hybridization may be as part of a PGR 
procedure, or as part of a probing procedure not involving 
30 PGR. An example procedure would be a combination of PGR 

and low stringency hybridization. A screening procedure, 
chosen from the many available to those skilled in the art, 
is used to identify successful hybridization events and 
isolated hybridized nucleic acid. 
35 Binding of a probe to target nucleic acid (e.g. DNA) 

may be measured using any of a variety of techniques at the 
disposal of those skilled in the art. For instance, probes 
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may be radioactively, f luorescently or enzymatically 
labelled. Other methods not employing labelling of probe 
include examination of restriction fragment length 
polymorphisms, amplification using PGR, RNAse cleavage and 
allele specific oligonucleotide probing. 

Probing may employ the standard Southern blotting 
technique. For instance DNA may be extracted from cells 
and digested with different restriction enzymes . 
Restriction fragments may then be separated by 
electrophoresis on an agarose gel, before denaturation and 
transfer to a nitrocellulose filter. Labelled probe may be 
hybridized to the DNA fragments on the filter and binding 
determined. DNA for probing may be prepared from RNA 
preparations from cells. 

Preliminary experiments may be performed by 
hybridizing under low stringency conditions various probes 
to Southern blots of DNA. digested with restriction enzymes . 
Suitable conditions would be achieved when a large number 
of hybridizing fragments were obtained while the background 
hybridization was low. Using these conditions, nucleic 
acid libraries, e.g. cDNA libraries representative of 
expressed sequences, may be searched. 

Those skilled in the art are well able to employ 
suitable conditions of the desired stringency for selective 
hybridization, taking into account factors such as 
oligonucleotide length and base composition, temperature 
and so on. 

On the basis of amino acid sequence information, 
oligonucleotide probes or primers may be designed, taking 
into account the degeneracy of the genetic code, and where 
appropriate, codon usage of the organism from the candidate 
nucleic acid is derived. An oligonucleotide for use in 
nucleic acid amplification may have about 10 or fewer 
codons {e.g. 6, 7 or 8), i.e. be about 30 or fewer 
nucleotides in length (e.g. 18, 21 or 24) . Generally 
specific primers are upwards of 14 nucleotides in length, 
but not more than 18-20. Those skilled in the art are well 
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versed in the design of primers for use processes such as 
PGR. 

A further aspect of the present invention provides an 
oligonucleotide or polynucleotide fragment of the 
5 nucleotide sequence shown in figures lA or IE, or a 

complementary sequence, in particular for use in a method 
of obtaining and/or screening nucleic acid. The sequences 
referred to above may be modified by addition, 
substitution, insertion or deletion of one or more 
10 nucleotides, but preferably without abolition of ability to 
hybridize selectively with nucleic acid with the sequence 
shown in figures lA or IE, that is wherein the degree of 
homology of the oligonucleotide or polynucleotide with one 
of the sequences given is sufficiently high. 
15 In some preferred embodiments, oligonucleotides 

according to the present invention that are fragments of 
any of the sequences shown in figures lA or IE, or an 
allele thereof, are at least about 10 nucleotides in 
length, more preferably at least about 15 nucleotides in 
20 length, more preferably at least about 20 nucleotides in 

length. Such fragments themselves individually represent 
aspects of the present invention. Fragments and other 
oligonucleotides may be used as primers or probes as 
discussed but may also be generated (e.g. by PGR) in 
25 methods concerned with determining the presence in a test 

sanple of a sequence indicative of susceptibility to one of 
the conditions mentioned above, e.g. diabetes, Parkinsonism 
and/ or dementia. 

Nucleic acid according to the present invention may be 
30 used in methods of gene therapy, for instance in treatment 

of individuals with the aim of preventing or curing (wholly 
or partially) the above mentioned conditions. This too is 
discussed below. 

A convenient way of producing a polypeptide according 
35 to the present invention is to express nucleic acid 

encoding it, by use of the nucleic acid in an expression 
system. The use of expression systems has reached an 
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advanced degree of sophistication. 

Accordingly, the present invention also encompasses a 
method of making a polypeptide (as disclosed) , the method 
including expression from nucleic acid encoding the 
5 polypeptide (generally nucleic acid according to the 

invention) . This may conveniently be achieved by growing 
a host cell in culture, containing such a vector, under 
appropriate conditions which cause or allow expression of 
the polypeptide. Polypeptides may also be expressed in in 

10 vitro systems, such as reticulocyte lysate. 

Systems for cloning and expression of a polypeptide in 
a variety of different host cells are well known. Suitable 
host cells include bacteria, eukaryotic cells such as 
mammalian and yeast, and baculovirus systems. Mammalian 

15 cell lines available in the art for expression of a 

heterologous polypeptide include Chinese hamster ovary 
cells, HeLa cells, baby hamster kidney cells, COS cells and 
many others. A common, preferred bacterial host is E. 
coli . 

20 Suitable vectors can be chosen or constructed, 

containing appropriate regulatory sequences, including 
promoter sequences, terminator fragments, polyadenylation 
sequences, enhancer sequences, marker genes and other 
sequences as appropriate. Vectors may be plasmids, viral 

25 e.g. 'phage, or phagemid, as appropriate. For further 

details see, for example. Molecular Cloning: a Laboratory 
Manual: 2nd edition, Sambrook et al., 1989, Cold Spring 
Harbor Laboratory Press. Many known techniques and 
protocols for manipulation of nucleic acid, for example in 

30 preparation of nucleic acid constructs, mutagenesis, 

sequencing, introduction of DISTA into cells and gene 
expression, and analysis of proteins, are described in 
detail in Current Protocols in Molecular Biology, Ausubel 
et al. eds., John Wiley & Sons, 1992. 

35 Thus, a further aspect of the present invention 

provides a host cell containing nucleic acid as disclosed 
herein. The nucleic acid of the invention may be 
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integrated into the genome (e.g. chromosome) of the host 
cell. Integration may be promoted by inclusion of 
sequences which promote recombination with the genome, in 
accordance with standard techniques. The nucleic acid may 
5 be on an extra- chromosomal vector within the cell. 

A still further aspect provides a method which 
includes introducing the nucleic acid into a host cell. 
The introduction, which may (particularly for in vitro 
introduction) be generally referred to without limitation 

10 as ''transformation", may employ any available technique. 

For eukaryotic cells, suitable techniques may include 
calcium phosphate transf ection, DEAE-Dextran, 
electroporation, liposome -mediated transf ection and 
transduction using retrovirus or other virus, e.g. vaccinia 

15 or, for insect cells, baculovirus. For bacterial cells, 

suitable techniques may include calcium chloride 
transformation, electroporation and transfection using 
bacteriophage. As an alternative, direct injection of the 
nucleic acid could be employed. 

2 0 Marker genes such as antibiotic resistance or 

sensitivity genes may be used in identifying clones 
containing nucleic acid of interest, as is well known in 
the art . 

The introduction may be followed by causing or 
25 allowing expression from the nucleic acid, e.g. by 

culturing host cells (which may include cells actually 
transformed although more likely the cells will be 
descendants of the transformed cells) under conditions for 
expression of the gene, so that the encoded polypeptide is 

3 0 produced. If the polypeptide is expressed coupled to an 

appropriate signal leader peptide it may be secreted from 
the cell into the culture medium. Following production by 
expression, a polypeptide may be isolated and/or purified 
from the host cell and/or culture medium, as the case may 
35 be; and subsequently used as desired, e.g. in the 

formulation of a composition which may include one or more 
additional components, such as a pharmaceutical composition 
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which includes one or more pharmaceutically acceptable 
excipients, vehicles or carriers /e.g. see below) . 

Introduction of nucleic acid may take place in vivo by 
way of gene therapy, as discussed below. 

5 A host cell containing nucleic acid according to the 

present invention, e.g. as a result of introduction of the 
nucleic acid into the cell or into an ancestor of the cell 
and/or genetic alteration of the sequence endogenous to the 
cell or ancestor (which introduction or alteration may take 

10 place in vivo or ex vivo) , may be comprised (e.g. in the 

soma) within an organism which is an animal, particularly 
a mammal, which may be human or non- human, such as rabbit, 
guinea pig, rat, mouse or other rodent, cat, dog, pig, 
sheep, goat, cattle or horse, or which is a bird, such as 

15 a chicken. Genetically modified or transgenic animals or 

birds comprising such a cell are also provided as further 
aspects of the present invention. 

This may have a therapeutic aim. (Gene therapy is 
discussed below.) The presence of a mutant, allele or 

20 variant sequence within cells of an organism, particularly 

when in place of a homologous endogenous sequence, may 
allow the organism to be used as a model in testing and/or 
studying the role of the IBl gene or substances which 
modulate activity of the encoded polypeptide in vitro. 

25 Instead of or as well as being used for the production 

of a polypeptide encoded by a transgene, host cells may be 
used as a nucleic acid factory to replicate the nucleic 
acid of interest in order to generate large amounts of it . 
Multiple copies of nucleic acid of interest may be made 

30 within a cell when coupled to an amplifiable gene such as 

DHFR. Host cells transfomed with nucleic acid of 
interest, or which are descended from host cells into which 
nucleic acid was introduced, may be cultured under suitable 
conditions, e.g. in a fermenter, taken from the culture and 

35 subjected to processing to purify the nucleic acid. 

Following purification, the nucleic acid or one or more 
fragments thereof may be used as desired, for instance in 



wo 98/44106 PCT/GB98/00972 

25 

a diagnostic or prognostic assay as discussed elsewhere 
herein. 



IBl Proteins 

5 The term '*IB-1 biological activity" is herein defined 

as binding to nucleic acid defining the GTII or RIPE3 
promoter elements to cause transcriptional activation of a 
gene under the control of a promoter including those 
elements- Experiments to determine this and other IBl 

10 activities are described in detail below. 

The skilled person can use the techniques described 
herein and others well known in the art to produce large 
amounts of the IBl polypeptide, or fragments or active 
portions thereof, for use as pharmaceuticals, in the 

15 developments of drugs and for further study into its 

properties and role in vivo. 

Thus, a further aspect of the present invention 
provides a polypeptide which has the amino acid sequence 
shown in figure lA, ID or IF, which may be in isolated 

2 0 and/or purified form, free or substantially free of 

material with which it is naturally associated, such as 
other polypeptides or such as human polypeptides other than 
IBl polypeptide or (for example if produced by expression 
in a prokaryotic cell) lacking in native glycosylation, 

25 e.g. unglycosylated. 

Polypeptides which are amino acid sequence variants, 
alleles, derivatives or mutants are also provided by the 
present invention- A polypeptide which is a variant, 
allele, derivative or mutant may have an amino acid 

30 sequence which differs from that given in figure lA, ID or 

IF by one or more of addition, substitution, deletion and 
insertion of one or more amino acids. Preferred 
polypeptides have IBl function, that is to say they 
activate the GLUT2 or insulin promoters, leading to 

35 production of GLUT2 protein or insulin. Preferably, this 

activation is specific to insulin- secreting cells and does 
not take place in non-pacreatic or glucagon producing 
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cells. 

A polypeptide which is an amino acid sequence variant , 
allele, derivative or mutant of the amino acid sequence 
shown in figure lA, ID or IF may comprise an amino acid 
5 sequence which shares greater than about 35% sequence 

identity, greater than about 40%, greater than about 50%, 
greater than about 60%, greater than about 70%, greater 
than about 80%, greater than about 90%, greater than about 
95%, greater than about 97%, greater than about 98% • or 

10 greater than about 99% sequence identity with the amino 

acid sequence shown in figures lA, ID or IF. Sequence 
comparison was made using the GCG program which is 
available from Genetics Computer Group, Oxford Molecular 
Group, Madison, Wisconsin, USA, Version 9.1. Particular 

15 amino acid sequence variants may differ from those shown in 

figures lA, ID or IF by insertion, addition, substitution 
or deletion of 1 amino acid, 2, 3, 4, 5-10, 10-20 20-30, 
30-50, 50-100, 100-150, or more than 150 amino acids. In 
this connection, ''sequence identity" means strict amino 

2 0 acid identity between the sequences being compared . 

However, ''sequence similarity" includes conserved changes 
in amino acid sequence on the basis that such conserved 
changes will not substantially affect the structure and/or 
function of the protein. By way of example, sequence 
25 identity exists when two polypeptides have a Met residue at 

corresponding positions. If the two polypeptides had a Val 
and lie at corresponding positions, there is a high degree 
of similarity as both amino acids have hydrophobic side 
chains. A lower degree of similarity exists between Ser 

3 0 and Ala, while no similarity exists between Gin and Glu. 

Methods of comparing similarity are well known in the art. 

The present invention also includes active portions, 
fragments, derivatives and functional mimetics of the IBl 
polypeptides of the invention. 
3 5 An "active portion" of IBl polypeptide means a peptide 

which is less than said full length IBl polypeptide, but 
which retains at least some of its essential biological 
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activity. For instance, smaller fragments of IBI can act 
as sequestrators or competitive antagonists by interacting 
with other proteins . 

A "fragment" of the IBI polypeptide means a stretch of 
5 amino acid residues of at least about five to seven 

contiguous amino acids, often at least about seven to nine 
contiguous amino acids, typically at least about nine to 13 
contiguous amino acids and, most preferably, at least about 
20 to 30 or more contiguous amino acids. Fragments of the 

10 IBI polypeptide sequence antigenic determinants or epitopes 

useful for raising antibodies to a portion of the IBI amino 
acid sequence. 

A ''derivative" of the IBI polypeptide or a fragment 
thereof means a polypeptide modified by varying the amino 

15 acid sequence of the protein, e.g. by manipulation of the 

nucleic acid encoding the protein or by altering the 
protein itself. Such derivatives of the natural arnino acid 
sequence may involve insertion, addition, deletion or 
substitution of one, two, three, five or more amino acids, 

20 without fundamentally altering the essential activity of 

the wild type IBI polypeptide. 

^'Functional mimetic" means a substance which may not 
contain an active portion of the IBI amino acid sequence, 
and probably is not a peptide at all, but which retains the 

25 essential biological activity of natural IBI polypeptide. 

The design and screening of candidate mimetics is described 
in detail below. 

In some embodiments, the fragments or derivatives 
include the domain from amino acids 566-612 of the sequence 

30 shown in figure lA or the domain from amino acids 563-609 

of the sequence shown in figure IF, or an active portion of 
that domain. 

As shown in figure IC, IBI comprises several distinct 
domains. The 47 amino acid insert contains a putative 
35 helix- loop-helix domain as well as a PID. PID domains have 

an average length of 100-160 amino acids and consist of 
four conserved blocks. The first block of the PID domain 
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of IBl is contained within the 47 amino acid insert. This 
and the neighbouring HLH domain may allow for protein- 
protein interactions, possibly with other members of the 
tyrosine kinase signalling pathway or with transcription 
5 factors. Computer analysis also showed two acidic 

helicoidal structures (aa 31-61 and 114-125 in the rat IBl 
sequence) and a proline rich region (aa 292-366) . Putative 
nuclear localization signals were found at aa 163-19 0 and 
242-270 . 

10 A polypeptide according to the present invention may 

be isolated and/ or purified (e.g. using an antibody) for 
instance after production by expression from encoding 
nucleic acid (for which see below) . Polypeptides according 
to the present invention may also be generated wholly or 

15 partly by chemical synthesis. The isolated and/or purified 

polypeptide may be used in formulation of a composition, 
which may include at least one additional component, for 
example a pharmaceutical composition including a 
pharmaceutically acceptable excipient, vehicle or carrier, 

20 A composition including a polypeptide according to the 

invention may be used in prophylactic and/or therapeutic 
treatment as discussed below. 

A polypeptide, peptide fragment, allele, mutant or 
variant according to the present invention may be used as 

25 an immunogen or otherwise in obtaining specific antibodies. 

Antibodies are useful in purification and other 
manipulation of polypeptides and peptides, diagnostic 
screening and therapeutic contexts. This is discussed 
further below. 

30 A polypeptide according to the present invention may 

be xised in screening for molecules which affect or modulate 
its activity or function. Such molecules may be useful in 
a therapeutic (possibly including prophylactic) context. 

The IBl polypeptides can also be linked to a coupling 

35 partner, e.g. an effector molecule, a label, a drug, a 

toxin and/or a carrier or transport molecule. Techniques 
for .coupling the peptides of the invention to both peptidyl 
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and non-peptidyl coupling partners are well knovm in the 
art. In one embodiment, the carrier molecule is a 16 aa 
peptide sequence derived from the homeodomain of 
Antennapedia (e.g. as sold under the name "Penetratin" ) , 
5 which can be coupled to a peptide via a terminal Cys 

residue. The "Penetratin" molecule and its properties are 
described in WO 91/18981. 

IBl Antagonists 

IBl antagonists include substances which have one or 
more of the following properties: 

(a) substances capable of inhibiting expression of 

IBl ; 

(b) substances capable of reducing the levels of IBl 
present in a target tissue or cell type by binding to and 
neutralizing the IBl; and/or, 

(c) substances capable of counteracting a biological 
property of IBl protein. 

An example of- an IBl antagonist is an antibody capable 
of specifically binding to IBl protein. Anti-IBl 
antibodies and their production are discussed below. 

Anti-IBl Antibodies 

A further important use of the IBl polypeptides is in 
25 raising antibodies that have the property of specifically 

binding to the IBl polypeptides, or fragments or active 
portions thereof. As mentioned above, anti-IBl antiserum 
was prepared against recombinant IBl {amino acids 1-280) . 
These antibodies were raised in a rabbit and are 
30 polyclonal. The antibodies were affinity purified and are 

useful tools as they can recognise IBl epitope (s). These 
and other antibodies that can be made based on the 
disclosure herein can be used as a diagnostic tools and in 
the characterisation of IBl . 
35 It is possible to produce monoclonal antibodies to IBl 

protein and the techniques for doing this are well 
established in the art. Monoclonal antibodies can be 
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subjected to the techniques of recombinant DNA technology 
to produce other antibodies or chimeric molecules which 
retain the specificity of the original antibody. Such 
techniques may involve introducing DNA encoding the 
5 immunoglobulin variable region, or the complementarity 

determining regions (CDRs) , of an antibody to the constant 
regions, or constant regions plus framework regions, of a 
different immunoglobulin. See, for instance, EP-A-184187, 
GB-A- 2188638 or EP-A-23 9400. A hybridoma producing a 

10 monoclonal antibody may be subject to genetic mutation or 

other changes, which may or may not alter the binding 
specificity of antibodies produced. 

The provision of the novel IBl polypeptides enables 
for the first time the production of antibodies able to 

15 bind it specifically. Accordingly, a further aspect of the 

present invention provides an antibody able to bind 
specifically to the polypeptide whose sequence is given in 
figure lA or IF, Such an antibody may be specific in the 
sense of being able to distinguish between the polypeptide 

20 it is able to bind and other human polypeptides for which 

it has no or substantially no binding affinity (e.g. a 
binding affinity of about lOOGx worse) . Specific 
antibodies bind an epitope on the molecule which is either 
not present or is not accessible on other molecules. 

25 Antibodies according to the present invention may be 
specific for the wild-type polypeptide. Antibodies 
according to the invention may be specific for a particular 
mutant, variant, allele or derivative polypeptide as 
between that molecule and the wild-type IBl polypeptide, so 

30 as to be useful in diagnostic and prognostic methods as 

discussed below. Antibodies are also useful in purifying 
the polypeptide or polypeptides to which they bind, e.g. 
following production by recombinant expression from 
encoding nucleic acid. 

35 Preferred antibodies according to the invention are 

isolated, in the sense of being free from contaminants such 
as antibodies able to bind other polypeptides and/or free 
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of serum components. Monoclonal antibodies are preferred 
for some purposes, though polyclonal antibodies are within 
the scope of the present invention. 

/mtibodies may be obtained using techniques which are 
standard in the art. Methods of producing antibodies 
include immunising a mammal (e.g. mouse, rat, rabbit, 
horse, goat, sheep or monkey) with the protein or a 
fragment thereof . Antibodies may be obtained from 
immunised animals using any of a variety of techniques 
known in the art, and screened, preferably using binding of 
antibody to antigen of interest. For instance, Western 
blotting techniques or immunoprecipitation may be used 
(Armitage et al , Nature, 357:80-82, 1992). Isolation of 
antibodies and/or antibody-producing cells from an animal 
may be accompanied by a step of sacrificing the animal . 

As an alternative or supplement to immunising a mammal 
with a peptide, an antibody specific for a protein may be 
obtained from a recombinantly produced library of expressed 
immunoglobulin variable domains, e.g. using lambda 
0 bacteriophage or filamentous bacteriophage which display 

functional immianoglobulin binding domains on their 
surfaces; for instance see WO92/01047. The library may be 
naive, that is constructed from sequences obtained from an 
organism which has not been immunised with any of the 
5 proteins (or fragments) , or may be one constructed using 

sequences obtained from an organism which has been exposed 
to the antigen of interest. 

Antibodies according to the present invention may be 
modified in a number of ways. Indeed the term ^'antibody" 
0 should be construed as covering any binding substance 

having a binding domain with the required specificity. 
Thus the invention covers antibody fragments, derivatives, 
functional equivalents and homologues of antibodies, 
including synthetic molecules and molecules whose shape 
5 mimics that of an antibody enabling it to bind an antigen 

or epitope. 

Example antibody fragments, capable of binding an 
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antigen or other binding partner are the Fab fragment 
consisting of the VL, VH, Cl and CHI domains; the Fd 
fragment consisting of the VH and CHI domains; the Fv 
fragment consisting of the VL and VH domains of a single 
5 arm of an antibody; the dAb fragment which consists of a VH 

domain; isolated CDR regions and F{ab')2 fragments, a 
bivalent fragment including two Fab fragments linked by a 
di sulphide bridge at the hinge region. Single chain Fv 
fragments are also included. 
10 Humanised antibodies in which CDRs from a non-human 

source are grafted onto human framework regions, typically 
with the alteration of some of the framework amino acid 
residues, to provide antibodies which are less immunogenic 
than the parent non-human antibodies, are also included 
15 within the present invention. 

A hybridoma producing a monoclonal antibody according 
to the present invention may be subject to genetic mutation 
or other changes. It will further be understood by those 
skilled in the art that a monoclonal antibody can be 
20 subjected to the techniques of recombinant DNA technology 

to produce other antibodies or chimeric molecules which 
retain the specificity of the original antibody. Such 
techniques may involve introducing DNA encoding the 
immunoglobulin variable region, or the complementarity 
25 determining regions (CDRs) , of an antibody to the constant 

regions, or constant regions plus framework regions, of a 
different immunoglobulin. See, for instance, EP-A-18418 7, 
GB-A-2188638 or EP-A-0239400 . Cloning and expression of 
chimeric antibodies are described in EP-A-0120694 and EP-A- 
30 0125023. 

Hybridomas capable of producing antibody with desired 
binding characteristics are within the scope of the present 
invention, as are host cells, eukaryotic or prokaryotic, 
containing nucleic acid encoding antibodies (including 
35 antibody fragments) and capable of their expression. The 

invention also provides methods of production of the 
antibodies including growing a cell capable of producing 
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the antibody under conditions in which the antibody is 
produced, and preferably secreted. 

The reactivities of antibodies on a sample may be 
determined by any appropriate means. Tagging with 
individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate 
detectable, and preferably measurable, signals. The 
linkage of reporter molecules may be directly or 
indirectly, covalently, e.g. via a peptide bond or non- 
covalently. Linkage via a peptide bond may be as a result 
of recombinant expression of a gene fusion encoding 
antibody and reporter molecule. 

One favoured mode is by covalent linkage of each 
antibody with an individual f luorochrome, phosphor or laser 
exciting dye with spectrally isolated absorption or 
emission characteristics. Suitable f luorochromes include 
fluorescein, rhodamine, phycoerythrin and Texas Red. 
Suitable chromogenic dyes include diaminobenzidine . 

Other reporters include macromolecular colloidal 
particles or particulate material such as latex beads that 
are coloured, magnetic or paramagnetic, and biologically or 
chemically active agents that can directly or indirectly 
cause detectable signals to be visually observed, 
electronically detected or otherwise recorded. These 
molecules may be enzymes which catalyse reactions that 
develop or change colours or cause changes in electrical 
properties, for example. They may be molecularly 
excitable, such that electronic transitions between energy 
states result in characteristic spectral absorptions or 
emissions. They may include chemical entities used in 
conjunction with biosensors. Biotin/avidin or 

biotin/streptavidin and alkaline phosphatase detection 
systems may be employed. 

The mode of determining binding is not a feature of 
the present invention and those skilled in the art are able 
to choose a suitable mode according to their preference and 
general knowledge . 
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Antibodies according to the present invention may be 
used in screening for the presence of a polypeptide, for 
example in a test sample containing cells or cell lysate as 
discussed, and may be used in purifying and/or isolating a 
polypeptide according to the present invention, for 
instance following production of the polypeptide by- 
expression from encoding nucleic acid therefor. Antibodies 
may modulate the activity of the polypeptide to which they 
bind and so, if that polypeptide has a deleterious effect 
in an individual, may be useful in a therapeutic context 
(which may include prophylaxis) . 

An antibody may be provided in a kit, which may 
include instructions for use of the antibody, e.g. in 
determining the presence of a particular substance in a 
test sample. One or more other reagents may be included, 
such as labelling molecules, buffer solutions, elutants and 
so on. Reagents may be provided within containers which 
protect them from the external environment , such as a 
sealed vial. 

Diagnostic Methods 

A nimber of methods are known in the art for analysing 
biological samples from individuals to determine whether 
the individual carries a IBl allele predisposing them to 
may be used as markers for human genetic diseases such as 
diabetes; neurological disorders including 

neurodegenerative diseases, such as dementia. Parkinsonism, 
and Alzheimer's disease; refractory epilepsia (familial 
forms) ; neuronal disabilities such as speech disorders and 
memory alteration; neuronal or glial tumours such as 
neuroblastoma or glioblastoma; autoimmune disease affecting 
the CNS such as systematic lupus erythromatosis (SLE) ; 
diabetes; heart diseases such as myocardial infarct and 
ischemia as apoptosis is involved in areas surrounding 
infarct; and brain attack, in particular in the prevention 
of apoptosis in ischemia or infarct . The purpose of such 
analysis may be used for diagnosis or prognosis, to assist 
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a physician in determining the severity or likely course of 
the condition and/or to optimise treatment of it. 
Alternatively, the methods can be used to detect IBl 
alleles that are statistically associated with a 
susceptibility to these conditions in the future, e.g. 
identifying individuals who would benefit from regular 
screening or prophylactic treatment. 

Broadly, the methods divide into those screening for 
the presence of IBl nucleic acid sequences and those that 
rely on detecting the presence or absence of the IBl 
polypeptide. The methods make use of biological samples 
from individuals that are suspected of contain the nucleic 
acid sequences or polypeptide. Examples of biological 
samples include blood, plasma, serum, tissue samples, 
tumour samples, saliva and urine. 

Exemplary approaches for detecting IBl nucleic acid or 
polypeptides include : 

(a) comparing the sequence of nucleic acid in the 
sample with the IBl nucleic acid sequence to determine 
whether the sanple from the patient contains mutations; or, 

(b) determining the presence in a sample from a 
patient of the polypeptide encoded by the IBl gene and, if 
present, determining whether the polypeptide is full 
length, and/or is mutated, and/or is expressed at the 
normal level; or, 

(c) using DNA fingerprinting to compare the 
restriction pattern produced when a restriction enzyme cuts 
a sample of nucleic acid from the patient with the 
restriction pattern obtained from normal IBl gene or from 
known mutations thereof; or, 

(d) using a specific binding member capable of binding 
to a IBl nucleic acid sequence (either a normal sequence or 
a known mutated sequence) , the specific binding member 
comprising nucleic acid hybridisable with the IBl sequence, 
or substances comprising an antibody domain with 
specificity for a native or mutated IBl nucleic acid 
sequence or the polypeptide encoded by it, the specific 
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binding meniber being labelled so that binding of the 
specific binding member to its binding partner is 
detectable; or, 

(e) using PGR involving one or more primers based on 
normal or mutated IBl gene sequence to screen for normal or 
mutant IBl gene in a sample from a patient. 

A ^'specific binding pair" corrprises a specific binding 
member (sbm) and a binding partner (bp) which have a 
particular specificity for each other and which in normal 
conditions bind to each other in preference to other 
molecules. Exanples of specific binding pairs are antigens 
and antibodies, molecules and receptors and complementary 
nucleotide sequences. The skilled person will be able to 
think of many other examples and they do not need to be 
listed here. Further, the term "specific binding pair" is 
also applicable where either or both of the specific 
binding member and the binding partner comprise a part of 
a larger molecule. In embodiments in which the specific 
binding pair are nucleic acid sequences, they will be of a 
length to hybridise to each other under the conditions of 
the assay, preferably greater than 10 nucleotides long, 
more preferably greater than 15 or 20 nucleotides long. 

In most embodiments for screening for IBl 
susceptibility alleles, the IBl nucleic acid in the sample 
will initially be amplified, e.g. using PGR, to increase 
the amount of the analyte as compared to other sequences 
present in the sample. This allows the target sequences to 
be detected with a high degree of sensitivity if they are 
present in the sarrple. This initial step may be avoided by 
using highly sensitive array techniques that are becoming 
increasingly important in the art . 

A variant form of the IBl gene may contain one or more 
insertions, deletions, substitutions and/ or additions of 
one or more nucleotides compared with the wild- type 
sequence which may or may not disrupt the gene function. 
Differences at the nucleic acid level are not necessarily 
reflected by a difference in the amino acid sequence of the 
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encoded polypeptide, but may be linked to a known 
dysfunction. However, a mutation or other difference in a 
gene may result in a frame-shift or stop codon, which could 
seriously affect the nature of the polypeptide produced (if 
5 any) , or a point mutation or gross mutational change to the 

encoded polypeptide, including insertion, deletion, 
substitution and/or addition of one or more amino acids or 
regions in the polypeptide. A mutation in a promoter 
sequence or other regulatory region may prevent or reduce 

10 expression from the gene or affect the processing or 

stability of the mRNA transcript. 

There are various methods for determining the presence 
or absence in a test sample of a particular nucleic acid 
sequence, such as the sequence shown in figure lA or IE or 

15 a mutant, variant or allele thereof. Exemplary tests 

include nucleotide sequencing, hybridization using nucleic 
acid immobilized on chips, molecular phenotype tests, 
protein truncation tests (PTT) , single-strand conformation 
polymorphism (SSCP) tests, mismatch cleavage detection and 

2 0 denaturing gradient gel electrophoresis (DGGE) . These 

techniques and their advantages and disadvantages are 
reviewed in Nature Biotechnology, 15:422-426, 1997. 

Tests may be carried out on preparations containing 
genomic DNA, cDNA and/or mRNA. Testing cDNA or mRNA has 

25 the advantage of the complexity of the nucleic acid being 

reduced by the absence of intron sequences, but the 
possible disadvantage of extra time and effort being 
required in making the preparations. RNA is more difficult 
to manipulate than DNA because of the wide -spread 

30 occurrence of RNAses. 

Nucleic acid in a test sample may be sequenced and the 
sequence compared with the sequence shown in figure lA or 
IE, to determine whether or not a difference is present. 
If so, the difference can be compared with known 

35 susceptibility alleles to determine whether the test 

nucleic acid contains one or more of the variations 
indicated. 
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Since it will not generally be time- or labour- 
efficient to sequence all nucleic acid in a test sample or 
even the whole IBl gene, a specific amplification reaction 
such as PGR using one or more pairs of primers may be 
employed to amplify the region of interest in the nucleic 
acid, for instance the IBl gene or a particular region in 
which mutations associated with a susceptibility to one of 
the conditions mentioned above. The amplified nucleic 
acid may then be sequenced as above, and/or tested in any 
other way to determine the presence or absence of a 
particular feature. Nucleic acid for testing may be 
prepared from nucleic acid removed from cells or in a 
library using a variety of other techniques such as 
restriction enzyme digest and electrophoresis. 

Nucleic acid may be screened using a variant- or 
allele-specif ic probe. Such a probe corresponds in 
sequence to a region of the IBl gene, or its complement, 
containing a sequence alteration known to be associated 
with a susceptibility to the conditions mentioned above. 
Under suitably stringent conditions, specific hybridisation 
of such a probe to test nucleic acid is indicative of the 
presence of the sequence alteration in the test nucleic 
acid. For efficient screening purposes, more than one 
probe may be used on the same test sample. 

Allele- or variant-specific oligonucleotides may 
similarly be used in PGR to specifically amplify particular 
sequences if present in a test sample. Assessment of 
whether a PGR band contains a gene variant may be carried 
out in a number of ways familiar to those skilled in the 
art. The PGR product may for instance be treated in a way 
that enables one to display the mutation or polymorphism on 
a denaturing polyacrylamide DNA sequencing gel, with 
specific bands that are linked to the gene variants being 
selected. 

An alternative or supplement to looking for the 
presence of variant sequences in a test sample is to look 
for the presence of the normal sequence, e.g. using a 
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suitably specific oligonucleotide probe or primer. 

Use of oligonucleotide probes and primers has been 
discussed in more detail above. 

Approaches which rely on hybridisation between a probe 
5 and test nucleic acid and subsequent detection of a 

mismatch may be employed. Under appropriate conditions 
(temperature, pH etc.), an oligonucleotide probe will 
hybridise with a sequence which is not entirely 
complementary. The degree of base-pairing between the two 

10 molecules will be sufficient for them to anneal despite a 
mis-match. Various approaches are well known in the art 
for detecting the presence of a mis -match between two 
annealing nucleic acid molecules. 

For instance, RNAse A cleaves at the site of a mis- 

15 match. Cleavage can be detected by electrophoresing test 

nucleic acid to which the relevant probe or probe has 
annealed and looking for smaller molecules (i.e. molecules 
with higher electrophoretic mobility) than the full length 
probe/ test hybrid. Other approaches rely on the use of 

20 enzymes such as resol vases or endonucleases . 

ThuS/ an oligonucleotide probe that has the sequence 
of a region of the normal IBl gene (either sense or anti- 
sense strand) in which mutations are known to occur may be 
annealed to test nucleic acid and the presence or absence 

25 of a mis-match determined. Detection of the presence of a 

mis-match may indicate the presence in the test nucleic 
acid of a mutation. On the other hand, an oligonucleotide 
probe that has the sequence of a region of the IBl gene 
including a mutation may be annealed to test nucleic acid 

30 and the presence or absence of a mis-match determined. The 

absence of a mis-match may indicate that the nucleic acid 
in the test sample has the normal sequence. In either 
case, a battery of probes to different regions of the gene 
may be employed. 

35 The presence of differences in sequence of nucleic 

acid molecules may be detected by means of restriction 
enzyme digestion, such as in a method of DNA fingerprinting 
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where the restriction pattern produced when one or more 
restriction enzymes are used to cut a sample of nucleic 
acid is compared with the pattern obtained when a sample 
containing the normal gene or a variant or allele is 
5 digested with the same enzyme or enzymes. 

The presence or the absence of an important regulatory 
element in a promoter or other regulatory sequence located 
in introns may also be assessed by determining the level of 
raRNA production by transcription or the level of 

10 polypeptide production by translation from the mRNA. 

A test sample of nucleic acid may be provided for 
example by extracting nucleic acid from cells, e.g. in 
saliva or preferably blood, or for pre-natal testing from 
the amnion, placenta or foetus itself. 

15 There are various methods for determining the presence 

or absence in a test sample of a particular polypeptide, 
such as the polypeptide with the amino acid sequence shown 
in figure lA, ID or IF or an amino acid sequence mutant, 
variant or allele thereof. 

20 A sample may be tested for the presence of a binding 

partner for a specific binding member such as an antibody 
(or mixture of antibodies) , specific for one or more 
particular variants of the polypeptide shown in figure lA, 
ID, or IF. 

25 A sample may be tested for the presence of a binding 

partner for a specific binding member such as an antibody 
(or mixture of antibodies) , specific for the polypeptide 
shown in figure lA, ID or IF. 

In such cases, the sample may be tested by being 

3 0 contacted with a specific binding member such as an 

antibody under appropriate conditions for specific binding, 
before binding is determined, for instance using a reporter 
system as discussed. Where a panel of antibodies is used, 
different reporting labels may be employed for each 

35 antibody so that binding of each can be determined. 

A specific binding member such as an antibody may be 
used to isolate and/or purify its binding partner 
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polypeptide from a test sample, to allow for sequence 
and/or biochemical analysis of the polypeptide to determine 
whether it has the sequence and/or properties of the 
polypeptide whose sequence is shown in figure lA or IF, or 
5 if it is a mutant or variant form. Amino acid sequence is 

routine in the art using automated sequencing machines. 

Therapeutics 

Pharmaceuticals and Peptide Therapies 
10 IBl polypeptides and antagonists and agonists may be 

useful in the treatment of a wide range of disorders 
because of the biological activities of IBl. Broadly the 
conditions that can be treated fall into four main areas, 
diabetes, neurological disorders, cancer and in the 
15 inhibition or promotion of apoptosis. 

For the treatment of diabetes, IBl or an IBl agonist 
can be used to elevate the insulin content and secretion in 
a patient and/or to elevate GLUT2 expression and therefore 
the sensitivity of a patient to glucose. Further, the 
20 ant i-apopt otic effects of IBl can be used to inhibit the 

apoptosis of |8-cells which occurs when jS cell exhaustion 
occurs in the type II diabetes. IBl could also be used to 
inhibit the apoptosis seen during the autoimmune 
destruction in type I diabetes. IBl can also help to 
25 improve cell survival when |8 cells are engineered for 

xenotransplantation or encapsulation, e.g. in a 
biocompatible polymer. 

In the treatment of neurological disorders, IBl 
antagonists can be used to promote cell death by reducing 
30 or removing the anti-apoptotic effects of IBl. This could 

have applications in the treatment of brain tumours such as 
neuroblastomas and glioblastomas, and in the treatment of 
refractory epilepsy. On the other hand, IBl or IBl 
agonists could be used to promote cell survival in neurons 
35 and consequently be useful therapies for neurodegenerative 

disorders, ischemic diseases, autoimmune diseases of the 
CNS, infarct of the brain to promote cell survival in cells 
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surrounding the infarct, Parkinsonism and to promote cell 
survival in sections of the spine. 

In the treatment of cancer, IBl antagonists can be 
used to promote cell death in tumour cells in cancers in 
5 which the cells do not undergo an apoptotic program. In 

surrounding, healthy tissues, IBl or IBl agonists could be 
used help protect the cells from the effects of cancer 
treatments. In these embodiments, clearly some form of 
selective delivery of the agents is preferred, 
10 From the above applications, IBl and IBl agonists have 

a general role in the protection if cells from apoptosis, 
while conversely IBl antagonists can be used to promote 
cell death. 

The IBl polypeptides, antagonists (e.g. antibodies), 

15 peptides and nucleic acid of the invention can be 

formulated in pharmaceutical compositions. These 
compositions may comprise, in addition to one of the above 
substances, a pharmaceutical ly acceptable excipient, 
carrier, buffer, stabiliser or other materials well known 

20 to those skilled in the art. Such materials should be non- 

toxic and should not interfere with the efficacy of the 
active ingredient. The precise nature of the carrier or 
other material may depend on the route of administration, 
e.g. oral, intravenous, cutaneous or subcutaneous, nasal, 
. 25 intramuscular, intraperitoneal routes. 

Pharmaceutical compositions for oral administration 
may be in tablet, capsule, powder or liquid form. A tablet 
may include a solid carrier such as gelatin or an adjuvant. 
Liquid pharmaceutical compositions generally include a 

3 0 liquid carrier such as water, petroleum, animal or 

vegetable oils, mineral oil or synthetic oil. 
Physiological saline solution, dextrose or other saccharide 
solution or glycols such as ethylene glycol, propylene 
glycol or polyethylene glycol may be included. 

3 5 For intravenous, cutaneous or subcutaneous injection, 

or injection at the site of affliction, the active 
ingredient will be in the form of a parenterally acceptable 
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aqueous solution which is pyrogen-free and has suitable pH, 
isotonicity and stability. Those of relevant skill in the 
art are well able to prepare suitable solutions using, for 
example, isotonic vehicles such as Sodium Chloride 
5 Injection, Ringer's Injection, Lactated Ringer's Injection. 

Preservatives, stabilisers, buffers, antioxidants and/or 
other additives may be included, as required. 

Whether it is a polypeptide, antibody, peptide, 
nucleic acid molecule, small molecule or other 

10 pharmaceutical ly useful compound according to the present 

invention that is to be given to an individual , 
administration is preferably in a "prophylactically 
effective amount" or a "therapeutically effective amount" 
(as the case may be, although prophylaxis may be considered 

15 therapy) , this being sufficient to show benefit to the 

individual. The actual amount administered, and rate and 
time- course of administration, will depend on the nature 
and severity of what is being treated* Prescription of 
treatment, e.g. decisions on dosage etc, is within the 

20 responsibility of general practitioners and other medical 

doctors, and typically takes account of the disorder to be 
treated, the condition of the individual patient, the site 
of delivery, the method of administration and other factors 
known to practitioners. Examples of the techniques and 

25 protocols mentioned above can be found in Remington's 
Pharmaceutical Sciences, 16th edition, Osol, A. (ed) , 1980. 

Alternatively, targeting therapies may be used to 
deliver the active agent more specifically to certain types 
of cell, by the use of targeting systems such as antibody 

30 or cell specific ligands . Targeting may be desirable for 

a variety of reasons; for example if the agent is 
unacceptably toxic, or if it would otherwise require too 
high a dosage, or if it would not otherwise be able to 
enter the target cells. 

35 Instead of administering these agents directly, they 

could be produced in the target cells by expression from an 
encoding gene introduced into the cells, eg in a viral 
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vector (a variant of the VDEPT technique - see below) . 
The vector could be targeted to the specific cells to be 
treated, or it could contain regulatory elements which are 
switched on more or less selectively by the target cells. 

Alternatively, the agent could be administered in a 
precursor form, for conversion to the active form by an 
activating agent produced in, or targeted to, the cells to 
be treated. This type of approach is sometimes known as 
ADEPT or VDEPT; the former involving targeting the 
activating agent to the cells by conjugation to a cell- 
specific antibody, while the latter involves producing the 
activating agent, e.g. an enzyme, in a vector by expression 
from encoding DNA in a viral vector (see for example, EP-A- 
415731 and WO90/07936) - 

A composition may be administered alone or in 
combination with other treatments, either simultaneously or 
sequentially, dependent upon the condition to be treated. 

Methods of Gene Therapy 

As a further alternative, the nucleic acid encoded the 
authentic biologically active IBl polypeptide could be used 
in a method of gene therapy, to treat a patient who is 
unable to synthesize the active polypeptide or unable to 
synthesize it at the normal level, thereby providing the 
effect provided by wild-type IBl and suppressing the 
occurrence of diabetes in the target cells. 

Vectors such as viral vectors have been used in the 
prior art to introduce genes into a wide variety of 
different target cells. Typically the vectors are exposed 
to the target cells so that transfection can take place in 
a sufficient proportion of the cells to provide a useful 
therapeutic or prophylactic effect from the expression of 
the desired polypeptide. The transfected nucleic acid may 
be permanently incorporated into the genome of each of the 
targeted tumour cells, providing long lasting effect, or 
alternatively the treatment may have to be repeated 
periodically. 
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A variety of vectors ; both viral vectors and plasmid 
vectors, are known in the art, see US Patent No. 5,252,4 79 
and WO93/07282. In particular, a nuinber of viruses have 
been used as gene transfer vectors, including 
papovaviruses, such as SV40, vaccinia virus, herpesviruses, 
including HSV and EBV, and retroviruses. Many gene therapy 
protocols in the prior art have used disabled murine 
retroviruses . 

As an alternative to the use of viral vectors other 
known methods of introducing nucleic acid into cells 
includes electroporation, calcium phosphate co- 
precipitation, mechanical techniques such as 
microinjection, transfer mediated by liposomes and direct 
DNA uptake and receptor-mediated DNA transfer . 

As mentioned above, the aim of gene therapy using 
nucleic acid encoding the IBl polypeptide, or an active 
portion thereof, is to increase the amount of the 
expression product of the nucleic acid in cells in which 
the level of the wild-type IBl polypeptide is absent or 
present only at reduced levels. Target cells for gene 
therapy include insulin secreting jS- cells or any neuron 
derived cells. Cell engineering can be used to provide the 
overexpression or repression of IBl in transfected cell 
lines which can then be subsequently transplanted to 
humans. Gene therapy can be employed using a promoter to 
drive IBl expression in a tissue specific manner (i.e. an 
insulin promoter linked to IBl cDNA will overexpress IBl in 
)S-cells and transiently in the brain) . If defective 
function of IBl is involved in neurological disease, IBl 
can be over expressed in transformed cell lines for 
transplantation. 

Gene transfer techniques which selectively target the 
IBl nucleic acid to target tissues are preferred. Examples 
of this included receptor-mediated gene transfer, in which 
the nucleic acid is linked to a protein ligand via 
polylysine, with the ligand being specific for a receptor 
present on the surface of the target cells. 
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IBl expressing vectors (used in several vectors such 
as adenovirus or retrovirus could be used if stably 
integrated as an anti-apoptotic gene product. This allows 
cells to survive the stressful event of encapsulation for 
5 xenotransplantation, and/ or to survive hypoxia if 
transplanted into a low oxygen environment such as the 
peritoneum. The cells can then survive and, if needed, 
secrete IBl. The IBl could be useful in the treatment or 
prevention of conditions described above, e.g. to promote 
10 cell survival if secreted into the central nervous system 

in the treatment of neurodegenerative disease. 

Antisense technology based on the IBl nucleic acid 
sequences is discussed above . 

15 Methods of Screening for Drugs 

A polypeptide according to the present invention may 
be used in screening for molecules which affect or modulate 
its activity or function. Such molecules may be useful in 
a therapeutic (possibly including prophylactic) context. 
20 It is well known that pharmaceutical research leading 

to the identification of a new drug may involve the 
screening of very large numbers of candidate substances, 
both before and even after a lead compoxind has been found. 
This is one factor which makes pharmaceutical research very 
25 expensive and time-consuming. Means for assisting in the 

screening process can have considerable commercial 
inportance and utility. 

A method of screening for a substance which modulates 
activity of a polypeptide may include contacting one or 
3 0 more test substances with the polypeptide in a suitable 

reaction medium, testing the activity of the treated 
polypeptide and comparing that activity with the activity 
of the polypeptide in comparable reaction medium untreated 
with the test substance or substances . A difference in 
35 activity between the treated and untreated polypeptides is 

indicative of a modulating effect of the relevant test 
substance or substances . 
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Combinatorial library technology provides an efficient 
way of testing a potentially vast nutnber of different 
substances for ability to modulate activity of a 
polypeptide. Such libraries and their use are known in the 
art. The use of peptide libraries is preferred. 

Prior to or as well as being screened for modulation 
of activity, test sxibstances may be screened for ability to 
interact with the polypeptide / e.g. in a yeast two-hybrid 
system (which requires that both the polypeptide and the 
test substance can be expressed in yeast from encoding 
nucleic acid) . This may be used as a coarse screen prior 
to testing a substance for actual ability to modulate 
activity of the polypeptide. Alternatively, the screen 
could be used to screen test substances for binding to a 
TBI specific binding partner, to find mimetics of the IBl 
polypeptide, e.g. for testing as therapeutics . 

In one embodiment, the present invention provides a 
method for screening test substances for a biological 
activity of an IBl polypeptide, the method employing a cell 
0 line which does not express IBl polypeptide, the cell line 

being transformed with an expression vector comprising a 
reporter gene under the specific control of GTII promoter, 
the method comprising exposing cell line to a test 
substance and determining whether expression of the 
5 reporter gene occurs. Thus, the binding of a test 

substance to the GTII promoter element causes a signal from 
the reporter (e.g. a colour change or the expression of a 
detectable product) . 

Following identification of a substance which 
0 modulates or affects polypeptide activity, the substance 

may be investigated further. Furthermore, it may be 
manufactured and/or used in preparation, i.e. manufacture 
or formulation, of a composition such as a medicament, 
pharmaceutical composition or drug. These may be 
S administered to individuals. 

Thus, the present invention extends in various aspects 
not only to a substance identified using a nucleic acid 
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accordance with what is disclosed herein, but also a 
pharmaceutical composition, medicament, drug or other 
composition comprising such a substance, a method 
5 comprising administration of such a composition to a 

patient, e.g. for treatment {which may include preventative 
treatment) of diabetes, use of such a substance in 
manufacture of a cotrposition for administration, and a 
method of making a pharmaceutical composition comprising 
10 admixing such a substance with a pharmaceutically 

acceptable excipient, vehicle or carrier, and optionally 
other ingredients . 

A substance identified using as a modulator of 
polypeptide function may be peptide or non-peptide in 
15 nature. Non-peptide "small molecules" are often preferred 

for many in vivo pharmaceutical uses. Accordingly, a 
mimetic or mimic of the substance (particularly if a 
peptide) may be designed for pharmaceutical use. 

The designing of mimetics to a known pharmaceutically 
20 active compound is a known approach to the development of 

pharmaceuticals based on a ''lead" compound. This might be 
desirable where the active compound is difficult or 
expensive to synthesise or where it is unsuitable for a 
particular method of administration, e.g. peptides are 
25 unsuitable active agents for oral compositions as they tend 

to be quickly degraded by proteases in the alimentary 
canal. Mimetic design, synthesis and testing is generally 
used to avoid randomly screening large number of molecules 
for a target property. 
30 There are several steps commonly taken in the design 

of a mimetic from a compound having a given target 
property. Firstly, the particular parts of the compound 
that are critical and/or important in determining the 
target property are determined. In the case of a peptide, 
35 this can be done by systematically varying the amino acid 

residues in the peptide, e.g. by substituting each residue 
in turn. Alanine scans of peptide are commonly used to 
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refine such peptide motifs. These parts or residues 
constituting the active region of the compound are known as 
its '"pharmacophore" . 

Once the pharmacophore has been found, its structure 
is modelled to according its physical properties, eg 
stereochemistry, bonding, size and/or charge, using data 
from a range of sources, e.g. spectroscopic techniques, X- 
ray diffraction data and NMR. Computational analysis, 
similarity mapping (which models the charge and/or volume 
of a pharmacophore, rather than the bonding between atoms) 
and other techniques can be used in this modelling process. 

In a variant of this approach, the three-dimensional 
structure of the ligand and its binding partner are 
modelled. This can be especially useful where the ligand 
and/or binding partner change conformation on binding, 
allowing the model to take account of this in the design of 
the mimetic. 

A template molecule is then selected onto which 
chemical groups which mimic the pharmacophore can be 
grafted. The template molecule and the chemical groups 
grafted on to it can conveniently be selected so that the 
mimetic is easy to synthesise, is likely to be 
pharmacologically acceptable, and does not degrade in vivo, 
while retaining the biological activity of the lead 
compound. Alternatively, where the mimetic is peptide 
based, further stability can be achieved by cyclising the 
peptide, increasing its rigidity. The mimetic or mimetics 
found by this approach can then be screened to see whether 
they have the target property, or to what extent they 
0 exhibit it. Further optimisation or modification can then 

be carried out to arrive at one or more final mimetics for 
in vivo or clinical testing. 

Screening for Substances Affecting IBl Expression 
5 The present invention also provides the use of all or 

part of the nucleic acid sequence of the IBl promoter in 
methods of screening for substances which modulate the 
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activity of the promoter and increase or decrease the level 
of IBl expression. 

"Promoter activity" is used to refer to ability to 
initiate transcription. The level of promoter activity is 
5 quantifiable for instance by assessment of the amount of 

mRNA produced by transcription from the promoter or by 
assessment of the amount of protein product produced by 
translation of mRNA produced by transcription from the 
promoter. The amount of a specific mRNA present in an 

10 expression system may be determined for example using 
specific oligonucleotides which are able to hybridise with 
the mRNA and which are labelled or may be used in a 
specific amplification reaction such as the polymerase 
chain reaction. Use of a reporter gene facilitates 

15 determination of promoter activity by reference to protein 

production. 

Further provided by the present invention is a nucleic 
acid construct comprising a IBl promoter region set out in 
figure 5 or a fragment, mutant, allele, derivative or 

20 variant thereof able to promoter transcription, operably 

linked to a heterologous gene, e.g. a coding sequence- A 
'"heterologous" or "exogenous" gene is generally not a 
modified form of IBl Generally, the gene may be 
transcribed into mRNA which may be translated into a 

25 peptide or polypeptide product which may be detected and 
preferably quant itated following expression. A gene whose 
encoded product may be assayed following expression is 
termed a ''reporter gene", i.e. a gene which "reports" on 
promoter activity. 

30 The reporter gene preferably encodes an enzyme which 

catalyses a reaction which produces a detectable signal, 
preferably a visually detectable signal, such as a coloured 
product. Many examples are known, including /3- 

galactosidase and luciferase. (8-galactosidase activity may 

35 be assayed by production of blue colour on substrate, the 

assay being by eye or by use of a spectrophotometer to 
measure absorbance. Fluorescence, for example that 
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produced as a result of luciferase activity, may be 
quantitated using a spectrophotometer. Radioactive assays 
may be used, for instance using chloramphenicol 
acetyltransf erase, which may also be used in non- 
5 radioactive assays. The presence and/or amount of gene 

product resulting from expression from the reporter gene 
may be determined using a molecule able to bind the 
product , such as an antibody or fragment thereof . The 
binding molecule may be labelled directly or indirectly 
0 using any standard technique. 

Those skilled in the art are well aware of a multitude 
of possible reporter genes and assay techniques which may 
be used to determine gene activity. Any suitable 
reporter/assay may be used and it should be appreciated 
5 that no particular choice is essential to or a limitation 

of the present invention. 

Nucleic acid constructs comprising a promoter (as 
disclosed herein) and a heterologous gene (reporter) may be 
employed in screening for a substance able to modulate 
0 activity of the promoter. For therapeutic purposes, e.g. 

for treatment of diabetes a substance able to up-regulate 
expression of the promoter directing the expression of 
normal may be sought. Alternatively, siibstances to down- 
regulate the promoter may help to prevent or inhibit the 
5 production of mutated IBl polypeptide, if this is an agent 

implicated in the development of diabetes. A method of 
screening for ability of a substance to modulate activity 
of a promoter may comprise contacting an expression system, 
such as a host cell, containing a nucleic acid construct as 
0 herein disclosed with a test or candidate substance and 

determining expression of the heterologous gene. 

The level of expression in the presence of the test 
substance may be compared with the level of expression in 
the absence of the test substance. A difference in 
5 expression in the presence of the test substance indicates 

ability of the substance to modulate gene expression. An 
increase in expression of the heterologous gene compared 
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with expression of another gene not linked to a promoter as 
disclosed herein indicates specificity of the substance for 
modulation of the promoter. 

A promoter construct may be introduced into a cell 
line using any technique previously described to produce a 
stable cell line containing the reporter construct 
integrated into the genome. The cells may be grown and 
incubated with test compounds for varying times. The cells 
may be grown in 96 well plates to facilitate the analysis 
of large numbers of compounds. The cells may then be 
washed and the reporter gene expression analysed. For some 
reporters, such as lucif erase the cells will be lysed then 
analysed. 

Following identification of a substance which 
5 modulates or affects promoter activity, the substance may 

be investigated further. Furthermore, it may be 

manufactured and/or used in preparation, i.e. manufacture 
or formulation, of a composition such as a medicament, 
pharmaceutical composition or drug. These may be 
0 administered to individuals. 

Materials and Methods 

Construction of an INS- 1 cDNA. expression library and 
cloning of the IBl cDNA. 

5 An oligo (dT) -primed cDNA was generated from 10 (ig of 

poly (A)* RNA obtained from the differentiated INS-1 insulin 
secreting cell line using a cDNA synthesis kit (Stratagene, 
La Jolla, CA) according to the manufacturer's instructions. 
The cDNAs were cloned into the EcoRI and Xhol sites of the 

0 LambdaZap Express expression vector (Stratagene, La Jolla, 

Ca) . A total of 2 X 10^ colonies were screened by the 
procedure described by Singh et al (3 8) using as probe 
concatanated GTII oligonucleotides. 5 /xg of double 
stranded GTII oligonucleotides 

5 ( 5 ' - GTATy^GGGTGTATTGATTGGATTACCATCAATACTCAGCTTCT- 3 ' ) were 

filled in by the Klenow fragment of DNA polymerase I in the 
presence of (a"P) deoxycytosine- triphosphate and the free 
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nucleotides separated through a G-50 spun column. These 
labelled oligonucleotides were subsequently ligated to 
generate the concatanated probe. The expression cloning 
was performed exactly as previously described for the 
5 Southwestern experiments, except that we used 10 ug/ml of 
single stranded DNA in place of poly dl/dC in a total 
reaction volume of 250 ml (3). One IBl positive clone was 
obtained from the screening and the cDNA sequenced in both 
5' and 3' orientations . 

10 

Cell lines - 

The transplantable X-ray induced rat insulinoma INS-1 
cell line was provided by Asfari et al . and grown as 
described (1) . The mouse insulin-producing ^TC3 cell line, 
15 the hamster glucagon-producing InRl-G9 cells and the 

kidney- derived COS- 7 cell line were cultured in RPMl 1640 
medium containing 10% fetal calf serum, 2 mM L-glutamine, 
100 U/ml penicillin and 100 mg/ml streptomycin (9 ,4 0) , 

20 Plasmid constructions, transient transf ections , CAT and 

lucif erase assays. 

The eukaryotic expression vector encoding IBl was 
constructed by inserting the IBl cDNA in the Nhel/Xhol 
sites of the CMV-driven plasmid PBKS (Stratagene, La Jolla, 

25 CA) to generate the PBKS -IBl vector. PCR mutagenesis was 

used to add a FLAG epitope (Kodak, New Haeven,CT) just C- 
terminal of the initiating methionine of the IBl cDNA in 
the PBKS-IBl construct. Dideoxy sequencing of the 
resulting plasmids was used to confirm the correct sequence 

30 of the clones thus obtained. The -410 bp of the rat 

insulin II promoter and the -338 bp of the murine GLUT2 
promoter were cloned 5' of the lucif erase gene of the 
pGL3Basic vector (Promega, Madison, WI) . The RIPE3 double- 
stranded oligonucleotides 

35 ( 5 - GATCTGGAAACTGCAGCTTCAGCCCCTCTGGCCATCTGCTGATCCG- 3 ' ) were 

multimerized, filled-in by Klenow, and blunt-end ligated 
into the Smal site of the SV4 0 early minimal promoter 
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linked to a lucif erase gene (pGL3 promoter, Promega) . The 
five copies of the RIPE3 element cloned into pGL3promoter 
were sequenced. The -316, -289, -254, -188, -159 and -66 
bp of the rat I insulin promoter linked to the CAT reporter 
5 gene were kindly provided by Jacques Philippe from the 

University Medical School of Geneva. 

All constructs were transiently transfected using the 
cationic reagent DOTAP in solution as recommended by the 
supplier (Boehringer Mannheim) . Two jug of the reporter 

10 DNAs (luciferase or CAT constructs) with 1 /zg of 
transactivator (PBKS or PBK/IBl) were used for 1-2 X 10^ 
cells and incubated for 48-56 hours. In some transfection 
experiments, co- transfection with 500 ng of the herpes 
simplex virus thymidine kinase (HSV-TK) promoter driving 

15 the Renilla luciferase gene (Promega, Madison, WI) were 

included in order to control the transfection efficiency. 
The transfected cells were harvested with Promega lysis 
buffer, the cellular debris removed, and the supernatant 
collected. Protein concentrations were determined using 

20 the Bio-Rad protein assay (Bio-Rad Laboratories, Richmond, 

CA) . Luciferase activities were measured twice with 50 to 
100 fig of protein extracts from each transfected plate 
according to the protocol of Brasier et al{4) . 

25 Gel shift assays and Southwestern experiments. 

Nuclear and cytoplasmic extracts were prepared 
according to the method of Dent and Latchmann (7) . The 
sequences of the oligonucleotides GTII and RIPE3 are 
described above. Complementary sense and antisense 

3 0 oligonucleotides were hybridized and then filled in by the 

Klenow fragment of DNA polymerase I in the presence of 
(a32P) deoxycytosine triphosphate. The end-labelled probe 
was incubated with 2 /ig nuclear extracts exactly as 
described previously for the band shift assays (3) . The 

3 5 Southwestern experiments were conducted as described with 

the modifications detailed above (3) . The in vitz-o 
translation experiments were conducted using the coupled 
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translation-traduction kit (TNT) from Promega (Madison, MI) 
and according to the manufacturer' s instructions in the 
presence of (^^S) methionine. The labelled protein was 
resolved by PAGE on a 10% SDS- containing gel. 

5 

RNA and Northern Blot Analysis. 

The RMA isolation and Northern blot analysis from rat 
tissues or cell lines were conducted exactly as previously 
described (3) . The rat pancreatic islets were isolated by 
10 the method of Gotoh et al (15) . 

Preparation of antisera. 

Anti-IBl antiserum was prepared using a cDNA" fragment 
encoding the first 280 amino acids of the protein. This 

15 fragment was inserted into the His -tagged pQE-9 expression 

vector (Quiagen, Basel, Switzerland) , expressed and 
subsequently purified through a Ni^"- containing column 
following instructions from the manufacturer (Quiagen, 
Basel, Switzerland) and used to elicit polyclonal 

20 antibodies in rabbits. To affinity-purified the 

antibodies, the Ni^*- column purified 1-2 80 aa of the 
recombinant protein was immobilized onto a nitrocellulose 
membrane and the rabbit serum purified by several steps of 
incubation with this membrane in PBS (phosphate-buf f ered 

25 saline) buffer and elution in 0.2M Tris -glycine at pH 2.8, 

followed by neutralization to pH -7.5, 

Immunohistochemistry . 

Adult mice under deep anesthesia were perfused with 

30 PBS and pancreata were quickly removed, incubated for 6 

hours in 4% paraformaldehyde and processed for paraffin 
embedding. Sections of 8/im were used for 
immxinocytochemistry . In brief, sections were incubated in 
the presence of 5% H2O2 for 3 0 min at room temperature, 

35 washed in PBS and incubated overnight in the presence of 5% 

bovine serum albumin dissolved in PBS buffer containing 
0.5% Triton X-100 and 2% goat serum. The sections were 
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then incubated for 14 hours at 4°C with the affinity- 
purified preimmune or immune anti IBl serum (dilution 
1/200) and then for 2 hours at room temperature with the 
secondary antibody (biotinylated goat anti -rabbit IgG from 
5 Vector/Reactolab S.A.) and 2 hours at room temperature with 

avidin-biotinylated peroxidase complex (ABC, Vector 
Reactolab SA) . The peroxidase reaction was finally 
visualized with 3,3' -diaminobenzidine tetrahydrochloride 
dihydrate (Fluka, Buchs, Switzerland) in PBS containing 

10 0.2% H2O2. Photographs were taken with a Leica microscope 

using transmitted light optics and Kodak Ektachrome EPJ 32 0 
films. For immunostaining, the /STC3 or the transfected 
COS-7 cells, these cells were plated on glass coverslips 48 
hours before the experiment. The cells were fixed in ice- 

15 cold methanol /acetone (50:50) and processed as described 

above with the l^BC detection kit. In COS-7 cells, the 
detection of the FLAG epitope was obtained using as a 
secondary antibody, the commercially available (Kodak, New 
Haeven,CT) monoclonal mouse antibody directed against the 

2 0 FLAG subsequently visualized with fluorescein 

isothiocyanate (FITC) -labelled goat anti-mouse antibodies. 
The rabbit anti -IBl antibodies were detected using a Texas 
red- labelled goat anti -rabbit antibody. The fluorescence 
images were obtained with 1600 ASA Kodak films. 

25 

Sequence of human IBl cDNA. 

A radiolafaelled rat IBl cDNA was initially used to 
screen at low stringency a human insulinoma cDNA library. 
Briefly, poly(A)+RNA was isolated using the Promega 

3 0 PolyA+RNA purification kit (Promega, Madison, WI) and a 

cDNA library was constructed using the X Zapp II Express 
expression vector (Stratagene, La Jolla, CA) according to 
the manufacturer's instructions. One positive clone was 
subjected to sequence analysis which revealed that the 1.8- 
35 kb insert was homologous to nucleotides 144 0 to 22 52 of rat 

IBl cDNA. Two 5' RACE reactions (Boehringer-Mannheim, 
Mannheim, Germany) were used to obtain part of the 5' 
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sequence of the human IBl cDNA. The first step in these 
two reactions was an RT-PCR amplification on total RNA 
isolated from a human glucagonoma . RT-PCR amplifications 
was performed using an oligo-dT- anchor primer (provided in 
5 the kit) and as antisense primers oligonucleotides RACE-1 

or -2, respectively (Table 1). Two consecutive PCR 
amplification steps were then performed on the RT-PCR 
product using the PCR anchor primer (provided in the kit) 
and as antisense primers the nested oligonucleotides RACE- 3 

10 and -4 (for the first reaction) or RACE- 5 and -6 (for the 

second reaction) . The first RACE reaction yielded the 
sequence corresponding to nucleotides 70 to 2 90 whereas the 
second RACE reaction gave sequences 480 to 711 of human IBl 
cDNA. The remaining sequences of human IBl cDNA 

15 (nucleotides 1 to 69, 291 to 479 and 712 to 1439) were 

deduced by comparing the sequence of the rat IBl cDNA to 
genomic sequences obtained from a bacterial artificial 
chromosome (BAC) library of human genomic DNA (see below) . 

20 Screening of a BAC library of human genomic DNA and 

sequence analysis of BAC clones. 

A BAC library of human genomic DNA (Research Genetics, 
Rockville, MD) was screened using as a probe the 1.8-kb 
fragment corresponding to the 3' end of the human IBl cDNA, 

25 as described above. Positive BAC clones were selected, 

purified according to the instructions of the manufacturer 
and digested with BamHl/Bgrlll for analysis by Southern 
blotting according to standard protocols. Genomic DNA 
extracted from circulating leukocytes of an apparently 

30 healthy individual was used for comparison. 

Fragments generated by BamHl/Bglll digest of two 
positive BAC clones were subcloned within a KS=/- 
Bluescript vector. These subclones were screened using two 
non-overlapping probes, the 1.8kb fragment corresponding to 

35 the 3' end of human IBl cDNA, as described above, or a 955- 

bp fragment corresponding to the 5' end of rat IBl cDNA. 
Positive clones were identified and subjected to sequence 
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analysis using T3 and T7 oligonucleotides. A walking 
approach was used to obtain intervening sequences . 
Finally, two pairs of oligonucleotides (IB1-787F and IB- 
1326R; IB1-1346F and IB1-1602R) designed based on the 
5 sequence of the human IBl cDNA were used for PCR- 

amplif ication using the BAG clone or Bluescript subclones 
as templates, PGR amplified fragments were either cloned 
within the pGEM-T Easy plasmid factor (Promega) or 
sequenced directly. 

10 

Fluorescent in-situ hybridization (FISH) . 

FISH analysis was performed using as probes the 
products of BAC-A or BAC-B clones digested with 
BarriHl/Bglll, respectively. An aliquot of fresh peripheral 

15 blood cells was washed twice in PBS and cultured for 2-3 

days in RPMI 1640 containing 2 0% fetal calf serum. Cells 
were incubated with colcemid, treated with a hypotonic KCl 
solution and fixed with an acetic acid: methanol (1:3) 
solution. Ghromosome slides were denatured in a solution 

20 containing 70% formamide in 2xSSC at 75 °C for 2 min. A 

total of -2 50 ng of DNA were labelled by nick- translation 
incorporating biotin-dUTP (Boehringer-Mannheim) and then 
treated with DNase I (Pharmacia, Uppsala, Sweden) at a 
concentration of 50 pg//xl for 20 min at room temperature. 

25 DNase I was inactivated at 70 °C for 10 min. Labelled probe 

were denatured at 37°C for 10 min in 25 /xl Hybridsol VII 
(Oncor) , allowed to reanneal to human Cotl DNA (3 fig) at 
37°G for 15 to 30 min, and hybridized overnight. The 
hybridization signals were detected using FITC-avidin or 

30 rhodamine- conjugated anti-digoxigenin monoclonal antibodies 

(Boehringer-Mannheim) . 

Radiation hybrid mapping. 

Radiation hybrid mapping was used to precisely map the 
35 JB2 gene. In the first series of experiments, PGR 

amplification was performed using one pair of 
oligonucleotides (IB1-1346F and IB1-1602R) were originally 
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designed according to the sequence of human IBl cDNA 
(nucle9tides 134 6 to 1602) . Subsequently, a second set of 
oligonucleotides (IBl-f and IBl-r) were designed. 
Amplification was performed in 96 -well plates 
5 (Polyfiltronics, Rockland, MA) . Each reaction contained 

25ng DNA, 0 , 3miyi of each primer, 0.2mM dNTP's, 0.2 unit 
AmpliTaq polymerase and IX PGR buffer (Perkin- Elmer, 
Norwalk, CT) (5% DMSO was also added when using the second 
set of primers) in a total volume of 20 /zl , The PGR 

10 program consisted of a 2 min (4 min for the second set of 

primers) initial denaturation at 94 'C followed by 35 cycles 
of 94'C for 30 sec (1 min), 58*C (62"C) for 1 min, 72'C for 
1 min and a final extension of 3 min (4 min) at 72*G. 
Amplicons were scored after electrophoresis through a 3% 

15 (w/v) 3:1 NuSieve agarose gel (FMC BioProducts, Rockland 

ME). The IBl STS's were evaluated by somatic cell and 
radiation hybrid on both the MIGMS Human/Rodent somaticcell 
hybrid Mapping Panel #2 (Cornell Institute for Medical 
Research, Camden NJ) and the PCRable DMA Monochromosomal 

20 Somatic Cell Hybrid Panel (BIOS, New Haven CT) . The 

Stanford 3 (G3) and Genebridge 4 (GB4) radiation hybrid 
panels (Research Genetics) were used to more precisely 
localize the genes. The vectors from G3 and BG4 panels 
were analyzed using the RHMAP RH2PT Version 3.0 (57) and 

25 the RHMAPPER Version 1.1 (58) respectively. 

Northern blot analysis of IBl expression in human tissues. 

A total of 2^g of poly(A)+RNA isolated from various 
human tissues (Clontech) were hybridized at 42 'C with the 
30 ^^P-labelled l.Skb probe corresponding to the 3' end of the 

human IBl cDNA (as described above) in a BxSSC buffer 
continuing 0.1 M NaPO^, 10 mmol/1 EDTA, 50% (v/V) formamide, 
1% (w/v) SDS, 5x Denhardt's and 0.1 mg/ml tRNA. After 
washing, the filter was exposed. 



35 
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Results 

Example 1 Isolation and sequence analysis of the IBl cDNA. 

The identification of a cis element named GTII located 
in the proximal ' region of the GLUT2 promoter which is 
5 functionally important to confer pancreatic expression of 

the gene was the initial step towards the identification of 
putative islet-specific trans-acting factors (3) . 
Therefore, the inventors constructed a library using INS-1 
cells as GTII -binding activity is restricted to insulin- 

10 secreting cells and its abundance was highest in a cell 

line that expresses high amounts of endogenous GLUT 2 . A 
poly-dT primed INS-1 cDNA expression libary was therefore 
constructed and screened by the procedure described by 
Singh et al (38) using as a probe concatanated GTII 

15 oligonucleotides. One positive clone was isolated from a 

primary screen of approximatively 2 x 10^ phage plaques. 
The 2,990 bp long insert encodes a large open reading frame 
of 714 amino acids. The gene product contained in this 
clone was subsequently termed IBl for Islet-Brain 1 as its 

2 0 expression is mainly restricted to these two tissues, as 

discussed below. The deduced amino acid sequence revealed 
the presence of a putative HLH dimerization domain 
conserved with other members of the bHLH family (Figure 1) . 
Using the SOPMA. algorithm (self optimized prediction method 

25 from alignements, OsTRS, Lyon, France) , computer analysis 

predicts two acidic helicoidal structures (a. a. 31-61 and 
114-125) and a proline-rich (a. a. 292-366) region in the 
amino-terminal part of the protein that could act as 
transactivation domains (25) . Putative nuclear 

30 localisation signals were also recognized (a, a. 163-190 and 

242-270) (8) . 

Example 2 Tissue distribution of IBl expression. 

Northern blot analysis of total RNA. and polyA" RNA from 

3 5 several adult rat tissues and cell lines indicated that IBl 

is abundantly expressed as a 3 and 3.2 kb transcript in 
several insulin- secreting cell lines (INS-1 and RIN5F) , in 
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isolated pancreatic islets and in the brain (Figure 2) . 
IBl transcripts were also detected, although to a lower 
extent, in the kidney and the heart. In this latter, a 
single 3.2 kb transcript was detected. In the brain, IBl 
5 expression is highest in the cortex and in the pituitary 

gland, although the IBl transcripts were also detected in 
the hypothalamus, the cerebellum and the medulla (Figure 
2D) . In the isolated pancreatic islets, IBl expression was 
not regulated by increasing the glucose concentration in 

10 the incubation medium from 2,8 mM to 30 mM suggesting that 

the gene is not transcriptionally regulated by glucose as 
this is the case for GLUT2 (52) . IBl expression is 
therefore restricted to a few tissues and, importantly, the 
IBl mRNA is not detected in the liver where GLUT2 is 

15 abundantly expressed. 

Example 3 IBl is immunodetected in rat tissues and in 
insulin- SGcrGting cell lines. 

Antibodies were raised against the bacterial ly- 

20 produced N-terminal part of IBl (aa 1-280) . These 

polyclonal antibodies were affinity purified as described 
in Materials and Methods and used in Western blotting. As 
shown in Figure 3A, these antibodies detect a 12 0 kDa 
protein in )8TC3 nuclear extracts which comigrate with the 

25 product obtained by in vitro translating the IBl cDNA in 

presence of ^^S methionine (Figure 3B) . Similarly, the IBl 
protein could be detected in both the nuclear and 
cytoplasmic extracts obtained from COS- 7 cells transiently 
transfected with the CMV driven IBl cDNA, which suggests 

30 that IBl is actively translocated in the nucleus (Figure 3C 

and D) . A survey of several rat tissues confirmed that the 
IBl transcripts expressed in brain are translated into 
immimodetec table protein once analyzed by Western blotting 
{Figure 3E) , 



Example 4 Iimxmostaininff analysis of IBl expression. 

To gain further information regarding the tissue and 
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cellular localization of IBl within the pancreas, 
immunohistochendstry studies were performed on mouse islets 
and /?TC3 cells. Using the affinity-purified antibodies 
directed against IBl, this factor was detected in the 
5 pancreatic islet, with a staining which differs from the 

one obtained with the anti-GLUT2 or the anti -insulin 
antibodies. The imtnunostaining reaction is negative in 
PTC2 cells incubated with preimmune serum whereas the 
signal was positive in the nuclei and the cytoplasm of the 

10 same cells exposed to the anti- IBl serum. In order to 

confirm the specificity of the anti- IBl antibodies in 
immunocytochemistry, a construct was generated which 
includes a FLAG epitope N-terminal to the IBl protein 
expressed \inder the control of a CMV promoter. This 

15 construct was transiently transfected into COS- 7 cells, 

immunodetected with an anti -FLAG antibody subsequently 
visualized by FITC-staining or with the anti-IBl antibody 
subsequently visualized with an anti-rabbit Texas red- 
labeled antibody. The IBl protein was detected in 

20 transfected COS-7 cells with both the anti-FLAG and the 

anti-IBl antibodies confirming that the protein is, at 
least in part, correctly translocated to the nuclei of COS- 
7 cells and that the anti-IBl antibody is specific to IBl. 

25 Example 5 The IBl protein binds specifically to GTII and to 

the insulin enhancer sequence RIPE3. 

By analogy with PDX-1 which was shown to be an 
homeodomain transcription factor and which is expressed 
specifically in the pancreatic islets and is able to 

30 control several genes expressed only in these cells through 

homologous DNA sequences (22,24,28,53,54), we hypothesized 
that IBl could similarly control several genes within the 
/?- cells. The GTII cis sequence used for the expression 
cloning of IBl shares some nucleic acid sequences identity 

35 with an inportant enhancer sequence of the insulin promoter 

termed RIPE3 for rat insulin promoter element 3. This 
RIPE3 element was previously shown to participate in the (S- 
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cell specific control of the insulin gene (17,20). As 
depicted in Figure 4A, some conserved nucleic acid 
sequences are present between GTII and RIPE3 . 

By Southwestern analysis of INS-1 nuclear extracts, 
5 both the GTII and RIPE3 probes detected a 120 kDa protein 

which is similar in size to the product obtained by in 
vitro translating the IBl cDNA in presence of ^^S methionine 
and to the protein detected using anti-IBl antibodies 
(Figure 4B) . Furthermore, the IBl cDNA was cloned 3' to 

10 a CMV promoter and transiently transfected into COS- 7, a 

cell line lacking endogenous IBl. Crude cellular extracts 
were subsequently analyzed by the Southwestern technique 
using the GTII probe. Only IBl -transfected COS-7 cells 
express the expected 120 kDa GTII-binding protein (Figure 

15 4C) . The IBl cDNA is therefore translated into a 12 0 kDa 

product which is able to bind the GTII probe in a manner 
similar to the endogenous binding activity observed in INS- 
1 or i3TC3 nuclear extracts. 

Gel retardation analysis were also conducted using 

20 either the GTII or the RIPE3 elements as probe with pTC3 

nuclear extracts. A shown in Figure 4D, the GTII-binding 
activity is competed with an excess of cold RIPE3 and 
inversely, the RIPE3 binding activity is competed with an 
excess of cold GTII nucleotides. Therefore, similar DNA- 

25 binding activities present in insulin secreting cells 

interact with both the GTII and the RIPE3 regulatory 
sequences. This suggests that IBl, which binds GTII and 
RIPE3, may regulates expression of the GLUT2 and insulin 
genes. Constructs were then generated that include only 

30 the amino (amino acids 1-280) or the COOH- terminal part 

(280-714) of the protein, and bacterially produced 
recombinant IBl proteins were obtained from these plasmids. 
The 280-714aa protein, but not the l-280aa protein, was 
able to bind the GTII cis sequence when tested by 

35 Southwestern analysis implying that the carboxy end of the 

protein contains the DNA-binding domain of IBl. 
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Example 6 Transcriptional activation hy JBl. 

Transcriptional activation by IBl was assayed by 
cotransf action experiments in the insulin- secreting cell 
line )3TC3 and in a glucagon producing cell line (InRl-G9) 
5 as well as the non pancreatic-derived cell line.COS-7 cells 

with an IBl expression vector (PBK/IBl) and several 
reporter contracts (Figure 5A) . The wild- type insulin I 
promoter region (-410 bp) and the proximal region of the 
GLUT2 promoter (-338 bp) were linked to a lucif erase 

10 reporter gene and transiently transfected in these cell 

lines. Overexpression of IBl transact ivated the GLUT2 
promoter 1.7 fold whereas the insulin gene was induced 3.8 
fold when compared to a co-transfection with the expression 
vector lacking the IBl cDNA (PBKS) . This effect was 

15 restricted to the insulin-secreting cells and clearly 

absent with the promoterless reporter construct (pGL3 ) . 
The restricted action of IBl in )8-cells suggest that IBl 
functions only in the presence of other regulators present 
in pancreatic jS-cells, possibly others yet-to-be-identified 

2 0 iS-cell specific transcription factors. Several 5' deletion 

constructs of the rat insulin I promoter were then 
similarly transfected into iSTC3 cells in the presence or 
the absence of the expression vector encoding IBl. This 
latter transactivated several exonuclease III -deleted 

25 regions of the insulin promoter, although this effect was 

lower once constructs 3' to the -159 bp were used (Figure 
SB). This indicates that the RIPE3 sequence located 5' to 
the -159 bp of the promoter could be an important 
regulatory element through which IBl transactivates the 

30 insulin gene (13) . For the next series of experiments we 

addressed the question whether IBl could mediate its 
stimulatory effect through the RIPE3 enhancer sequence of 
the rat insulin II gene. This enhancer sequence has been 
extensively studied by several investigators . Two separate 

35 subelements of RIPE3, RIPE3a and RIPE3b, function 

cooperatively to generate maximal activity of this tissue- 
specific enhancer (17) . Each subelement binds ubiquitously 
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expressed or 0-cell specific trans-acting factors such as 
BETAl, BETA2 and E47 (26,27,34,37). Most interestingly, 
the BETA2/E47 heterodimer has been shown to be a potent 
transactivator of the RIPE3 elements, effect mediated 
through the E box located within the RIPE3a subelement 
(27) . However, this stimulatory effect necessitate the 
RIPE3b adjacent subelement where putative jS-cell specific 
trans-acting factor (s) may bind to optimise the stimulatory 
effect the HLH factors. To investigate if IBI is one of 
the possible partners of the RIPE3 binding factors, 5 
copies of the RIPE3 element were cloned 5' to a minimal 
heterologous promoter (SV40) linked to a lucif erase gene. 
This construct or its parent vector (SV40 luciferase) were 
transfected into iSTC3 cells with or without the expression 
15 vector encoding IBI. As shown in Figure 5C, IBI 

transactivated the RIPE3 construct 5-fold and this effect 
was not present in the glucagon producing cell line InRl-G9 
(data not shown) . Taken together, these results confirm 
that IBI is a component of the RIPE3 binding factors as 
evaluated by Southwestern analyses and cross -competition 
assays with the RIPE3 and GTII elements (Figure 5) and as 
evaluated by the functional data obtained with the RIPE3 
construct (Figure 5) . 

25 Example 7 Sequence of human IBI cDNA and comparison witzh 

rat IBI CDNA. 

Screening of a human insulinoma cDNA library using as 
a probe the rat IBI cDNA yielded the 3' end of sequence of 
human IBI cDNA. This sequence started at position 

30 corresponding to nucleotide 1324 of rat IBI cDNA. Part of 

the 5' end of human IBI cDNA was obtained by a RACE and RT- 
PCR performed on total RNA isolated from a human 
glucagonoma, as described in the Methods, whereas the 
remaining 5' end sequence was deduced by comparison of 

35 genomic sequences with the rat IBI cDNA, Human IBI cDNA 

comprises a total of 2136 nucleotides (Figure 1) . A 141 -bp 
sequence corresponding to a 47 amino-acid motif which 
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distinguishes IBl from JIP-i was present in the cDNA clone 
and is given in bold. Comparison at the amino acid level 
of the deduced human (upper lane) and rat (lower lane) IBl 
sequences is given in Table 2. Overall, there is a 95% 
similarity and 94% identity between rat and human IBl. Rat 
IBl CDNA comprised two ATG codons separated by an 18 -bp 
sequence, so that the initiating methionine was ambiguous. 
Human IBl cDNA only contains one unambiguous ATG codon 
which is flanked in 3' by a large open reading frame (ORF) 
homologous to nucleotides 21 to 122 of the rat IBl cDNA. 
As compared to rat IBl, human IBl contains five inserted 
residues (in positions 7, lis, 142 and 267), whereas one 
amino acid is lacking between residues 79 and 8 0 so that, 
altogether, human IBl contains four more residues than rat 
1^ 3:bi. It is worth noting the fact that apparently 

functionally important sequences, as deduced from computer 
analysis of rat IBl sequence are highly conserved between 
rat and human IBl. Finally, 46 amino acids out of the 47- 
amino acid residues fragments which distinguishes IBl from 
JIP-1 were identical between rat and human IBl, Taken 
together, these data show that the IBl gene is expressed in 
human insulinoma. In addition, the high degree of identity 
at the amino-acid level between rat and human IBl suggests 
that IBl may play an important biological role. 



20 



25 



Exan^le 8 Southern hlot analysis of the hximan IBl ^ene. 

A BAG library of human genomic DMA was screened using 
as a probe a l.Skb fragment corresponding to the 3' end of 
human IBl cDNA. Six strongly positive clones were 

30 selected. Southern blot analysis was performed on 

BamHl/Sgrlll digested BAG clones using this same probe. Two 
different patterns were observed. One clone, BAG- A, 
displayed four bands of apparent size -2.0, -1.6, -1.3 and 
-O.akb (Figure 7, lane 2), respectively. In contrast, BAG- 

35 B (lane 3) showed the presence of a strong 4.Bkb band and 

two additional larger faint bands. The prominent 4.8kb 
band visible in BAC-B was also present in the remaining 



wo 98/44106 



PCT/GB98/00972 



67 

four selected BAG clones (data not shown) , The banding 
patterns obtained with BAC-A and BAQ-B clones were compared 
to the one observed when analyzing BarriRl / Bglll digested 
human genomic DNA (lane 1) . A total of seven bands were 
5 visible in human genomic DNA. All the bands identified in 

human genomic DNA were accounted for by the bands visible 
when analyzing BAC-A and BAC-B, In particular, the 
prominent 4.8kb band and two larger faint bands were 
similar in size to the bands detected in BAC-B (lane 3), 

10 whereas the four smaller faint bands visible in human 

genomic DNA were comparable in size to the four bands 
visible in BAC-A (lane 2> . The filter was stripped and the 
a 955-bp fragment corresponding to the 5' end of rat Jhl 
cDNA was used as a probe. A total of three bands of 

15 apparent sizes 6.0, 1.5 and 1.4kb, respectively, were 

visible for BAC-A, whereas the strong 4 . 8kb band previously 
identified when using the 3' probe was the only band 
visible for BAC-B (data not shown) . Taken together, these 
data indicate that 1) BAC-A is likely to include the multi- 

20 exonic IBl gene: 2) the size of the IBl gene approximates 

14kb; 3) BAC-B contains an homologue of the IBl gene or an 
JSI pseudogene; 4) the presence of additional JBl 
homologues or pseudogenes in the human genome is unlikely; 
in particular, our data do not provide any evidence for the 

25 presence in human genome of a gene other than the IBl gene 

that would encode JIP-1. 

Example 9 Stjruciziure of t±e human XB2 g^ene. 

Fragments generated by digestion of BAC-A with 

30 BcuriHl/Bgrlll were cloned into a Bluescript phagemid vector 

and these clones were screened using two non-overlapping 
probes, a 955-bp fragment corresponding to the 5' end of 
rat IBl cDNA or the 1.8-kb fragment corresponding to the 3' 
end of human JBI cDNA. Positive clones were subjected to 

35 sequence analysis. This analysis, completed by a PCR-based 

approach for regions of the gene that were not present in 
the Bluescript subclones, revealed the sequence of the 
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coding region of the JBl gene and its structural 
organization (Figure 6) . The JBl gene contains 12 exons 
and 11 introns. The size of the exons ranges from 71 (axon 
10) to 813 (exon 5) nucleotides. A complete identity was 
5 observed between the available nucleic acid sequence of 

human IBl cDNA (obtained as described above) and sequences 
of the coding regions of the IBl gene. The position of the 
intron-exon junctions are indicated in Figure 6 by a 
bullet. It is worth noting that the sequence encoding the 

10 47 amino-acid motif which distinguishes IBl from JIP-1 

(which is indicated by a bar in Figure 6) spans over two 
adjacent exons (exons 8 and 9) . Sequence analysis of the 
intron-exons junctions (Table 3) revealed that all splice 
sites obeyed to the GT-AG paradigm. These data indicate 

15 that 1) IBl is encoded by a multi-exonic gene contained 

within the BAC-A clone and 2) the JIP-1 isoform may be an 
alternative spliced variant of the IBl gene; however, the 
intron-exon structure of the JBI gene renders this 
possibility unlikely. 

20 

Example 10 Sequence of the 4.8kb fragment of BAC-B clone. 

This analysis disclosed a high degree of homology 
between this genomic fragment and human IBl cDNA. In 
particular, a 737 -bp ORF which was 97% homologous to 

25 nucleotides 1297 to 2133 of human IBl cDNA was identified. 

An additional 657 bp ORF was identified which included a 
4B8-bp sequence homologous to nucleotides 228 to 716 of 
human IBl cDNA. However, numerous stop codons were 
identified between the ORF's, As a first step in 

30 determining whether these ORFs were expressed in eukaryotic 

cells, the Bluescript vector containing the 4.8 kb fragment 
was transfected into COS-1 cells. No immunoreactive 
material was identified in transfected COS-1 cells (data 
not shown) , a finding consistent with the ORF being not 

35 expressed in COS-1 cells. The same result was obtained 

when the 4.8kb fragment was excised and inserted within a 
CMV-driven plasmid transfected in the same way (data not 
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shown) . Altogether, these data strongly suggest that this 
IBl cDNA-related genomic sequence corresponds to an IBl 
pseudogene. The nature of the two larger bands identified 
in Southern blot analysis of human genomic DNA and BAC-B 
5 was not determined. The fact that these two bands were 

only detected when using the 1.8kb probe from the 3' end of 
the IBl cDNA suggests that they correspond to sequences 
which share some homology with the 3 ' end of human IBl 
CDNA. 

10 

Exaxnple 11 Chromosomal localization of the IBl gene and tzhe 
IBl pseudogene . 

Fluorescent in-situ hybridization (FISH) was performed 
to map the JBI gene and the IBl pseudogene. Fragments 
15 generated by BamHl/Bglll digestion of BAC-A and BAC-B were 

labelled and used as probes BAC-A (corresponding to the IBl 
gene) and BAC-B (corresponding to the JBI pseudogene) 
probes. The JBI gene and the JBI pseudogene are located on 
separate chromosomes. The JBI gene is positioned close to 

2 0 the centromere on the short arm of chromosome 11, whereas 

the IBl pseudogene maps to the long arm of chromosome 17 . 
The chromosomal localizations were confirmed when using 
centromere -specific markers for chromosome 11 and 
chromosome 1 7 . 

25 Next, fine mapping of the JBI gene and JBI pseudogene 

was performed using radiation hybrid mapping. This 
analysis was initially carried out using one pair of 
oligonucleotides designed using the sequence of human IBl 
cDNA. These primers amplified a -260 bp fragment in the 

3 0 radiation hybrid mapping system corresponding to sequences 

1346 to 1602 of human cDNA. This fragment corresponds to 
sequences from the JBI pseudogene, as amplification of 
sequences from the JBI gene would have resulted in a larger 
product spanning over three exons (exons 5 to 7) and two 
35 introns . Analysis of the Stanford G3 and Genebridge G4 

radiation hybrid panels confirmed the observation done by 
FISH that the JBI pseudogene is localized on the long arm 
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of chromosome 17, which corresponds to 17q21. 32-33 on the 
LDB (59) cytogenic map. The reason why the location of the 
IBl gene was not detected using this pair of 
oligonucleotides is not known. To locate the position of 
5 the IBl gene, an additional pair of primers were used. One 

primer was designed based on the sequence of IBl cDNA and 
the other one on intronic sequences within the IBl gene. 
Results from the two radiation hybrid panels placed the JBI 
gene on the short arm of chromosome 11, thus confirming the 
10 previous observation done by FISH, which corresponds to 
llpll.l2 on the LDB (59) cytogenic map. The TBI gene was 
localized adjacent to markers D11S1344 and D11S3979. 

Exanple 12 Expression of IBl in human tissues. 

Expression of IBl in human tissues was examined using 
Northern blot analysis of poly(a)+RNA isolated from a panel 
of different human tissues and the 1 . 8kb fragment 
corresponding to the 3 ' end of human IBl cDNA as a probe . 
IBl was expressed as a single transcript in the pancreas, 
the heart, brain and testis. The tissue distribution of 
IBl in human tissues closely resembles the distribution 
previously reported in rat tissues. 

Discussion 

25 These results show the isolation and characterization 

of a new transact ivator of the GLUT2 and the insulin genes. 
This factor, termed IBl, is highly expressed in pancreatic 
jS-cells and was isolated by its ability to bind to a cis 
regulatory element of the GLUT2 promoter. We also 

30 demonstrate that IBl is able to bind to a homologous 

regulatory element of the insulin gene, RIPE3. This 
enhancer sequence contains two separate sub- elements , 
RIPE3a and RIPE3b, which function cooperatively to generate 
maximal activity of this tissue-specific enhancer (Hwung et 

35 al., 1990) , RIPE3 binds ubiquitously expressed and /3-cell 

specific trans-acting factors such as BETAl, BETA2 and E47 
(Shieh and Tsai, 1991; Murre et al . , 1989; Peyton et al . , 



15 
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1994; Naya et al . , 1995). The BETA2/E47 heterodimer has 
been shown to be a potent transact ivat or of the RIPE3 
element, an effect mediated through the E box located 
within the RIPE3a subelement (Naya et al . , 1995). This 
stimulatory effect necessitates the RIPE3b adjacent sub- 
element where yet-to-be- identified /8-cell specific trans- 
acting factor (s) bind. As IBl is able to bind RIPE3 in 
vitro and as this factor is able to transactivate RIPE3 
mediated reporter gene expression in (3 cells, we propose 
that IBl is one of the important partners of the RIPE3- 
binding proteins. The competition assays using as probe 
either GTII or RIPE3 demonstrated that the GTII -binding 
proteins have a high affinity for the GTII cis element, 
however they are competed with an excess of cold RIPE3 
oligonucleotides. Conversely, the RIPE3 -binding proteins 
detected by mobility shift (EMSA) assay with the RIPE3 
probe are competed with an excess of cold GTII. However, 
we were unable to supershift, using anti-IBl antibodies, 
the GTII- or RIPE3 -binding activities detected by EMSA. 
This observation suggests that the antibodies raised 
against IBl are unable to detect IBl in non-denaturating 
conditions where the epitopes may be masked by other 
binding proteins. Furthermore, IBl binding activity 
necessitates post-translational modifications . 
Dephosphorylation of |8TC3 nuclear extracts abolishes the 
binding of IBl to GTII or RIPE3 when assessed by 
Southwestern analysis. 

Functionally, IBl is a transactivator of the GLUT2 
promoter (170% increase over basal) but a potent inducer of 
insulin gene transcription (380% increase over basal) . 
This effect may reflect the higher binding affinity of IBl 
to GTII rather than RIPE3 (see Figure 5) . As IBl functions 
only in iS-cell lines where endogenous IBl is present, one 
may speculate that the GLUT2 reporter construct cannot be 
stimulated with the expression vector encoding IBl since 
the high affinity GTII cis elements are already occupied by 
endogenous IBl. More recently, Stellrecht and co-authors 
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have elegantly demonstrated that multiple copies of the 
RIPE3 sequence are able to drive reporter gene expression 
in a restricted manner in pancreatic iS-cells and in the 
brain of transgenic mice (Stellrecht et al . , 1997) . As IBl 

5 expression is the highest in the brain and in jS-cells and 

as this factor transact ivates the insulin gene through 
RIPE3, these observations show that IBl participates in the 
tissue-specific control of the insulin gene. 

As observed with BETA2 , IBl is expressed in a highly 
10 restricted manner (Naya et al . , 1995). BETA2 is found in 

Qf-and /J-cells and in the brain whereas IBl expression is 
present only in jS-cells, in the brain and to a lower extent 
in the heart and the kidney. Interestingly, GLUT2 is 
expressed in /8- cells, in the gut, the kidney, the liver and 

15 in a subset of neurons (Thorens et al . , 1988; Leloup et 

al., 1994; Orci et al . , 1989) and the gene encoding GLIJT2 
is abnormally regulated only in the endocrine pancreas when 
diabetes is present (Johnson et al . , 1990; Ohneda et al . , 
1993; Orci et al . , 1990b; Orci et al . , 1990a; Thorens et 

20 al., 1992; Thorens et al . , 1990; Unger, 1991), As IBl is 

a transact ivator of the insulin and the GLUT2 genes and as 
IBl expression is mainly restricted to )3-cells and is not 
expressed in liver, abnormal expression or function of IBl 
could be responsible for the diabetic- and ^S-cell specific 

25 dysregulation of GLUT2 . Phosphorylation is necessary, as 

a post translational modification of IBl, to allow the 
binding of IBl to its recognition site. Some jS-cell 
functions, including phosphorylating activities, are 
altered during diabetes and therefore could induce a loss 

30 of IBl binding activity. 

The recent identification of mutations present in the 
transcription factors HlTF-lcy and HNF-4Q!, factors which are 
weak transact ivators of the insulin gene and which are also 
expressed in jS-cells, is of interest since they are 

35 responsible for the onset of two forms of diabetes, M0DY3 

and MODYl, respectively (Yamagata et al . , 1996b; Yamagata 
et al . , 1996a) . 
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An intriguing observation on IBl is its subcellular 
localization as it is located in, both the cytoplasmic and 
nuclear compartments. IBl contains a putative nuclear 
translocation signal (Dingwall and Laskey, 1991) and 
5 several lines of evidence suggest that indeed, IBl is a 

nuclear protein. First, IBl is a DNA-binding protein as 
this factor has been cloned based on its ability to bind to 
the GTII CIS element. Second, the Southwestern experiments 
could detect IBl in nuclear extracts of transfected COS 

10 cells with the expression vector encoding IBl. Third, 

immunodetectable nuclear staining is present by 
histochemistry in pancreatic islets as well as in j3TC3 
cells and Western blot analysis of these cells could also 
detect IBl in the cytoplasm and the nucleus. Finally, IBl 

15 is clearly a transact ivator of the GLUT2 and the insulin 

promoter linked to a reporter gene which implies that this 
factor functions as a transcription factor. The mechanisms 
responsible for the active translocation of IBl in the 
nucleus are not yet understood but could be of importance 

20 as IBl transacting functions may simply depend on the 

control of translocation. Several other similar 

observations are described where a DNA binding protein is 
sequestrated in the cytoplasm and translocated into the 
nucleus once proper stimulus is present. NFkB is 

25 sequestered in the cytoplasm by an interaction with 1k-B 

(Beg et al., 1992). Once phosphorylated, the complex is 
dissociated and NF«B is translocated in the nucleus and 
acts as a transactivator (Ghosh and Baltimore, 1990) . BZP 
has also recently been described as a zinc finger DNA- 

30 binding protein expressed in jS-cells (Franklin et al . , 

1994) . Serum deprivation caused BZP to remain cytoplasmic, 
whereas the adjunction of serum induces BZP translocation 
allowing its function as an inhibitor of gene 
transcription. The mechanism of sequestration of the cAMP 

35 responsive element binding protein (CREB) in the cytoplasm 

of germinal cells has also been described and involves the 
formation of a truncated CREB protein, lacking the nuclear 
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translocation signal, by the use of an alternatively 
spliced exon (Waeber and Habener, 1991; Waeber et al . , 
19 91) , The mechanisms which trigger the translocation of 
IBl into the nucleus might involve glucose, serum or any 
5 cellular component such as cAMP or ATP. This issue is 

important to resolve as the IBl transactivating function 
may be dependent upon proper regulated translocation of the 
protein into the nucleus. 

The above results demonstrate that IBl is a novel DNA- 

10 binding protein which is expressed in a highly restricted 

manner in |S-cells and in the brain, which functions, 
together with other unidentified jS-cell specific factors, 
as a transact ivator of the GLUT2 and the insulin gene 
through homologous DNA sequences. Due to the restricted 

15 expression of IBl, we propose that IBl is an important 

factor which confers /3-cell-specif icity to the insulin and 
GLUT2 genes. 

IBl/JIP-1 is a key regulator of insulin secretion in 
the rat and an inhibitor of the JNK-activation pathway in 

2 0 the mouse. We demonstrate in this study that IBl/JIP-l is 

also present in humans. Indeed, human IBl is highly 
similar to mouse and rat IBl, Furthermore, IBl is 
expressed in humans in the same tissues as was previously 
reported for rodents. The JBI gene maps to the long arm of 
25 chromosome 11 at position llpll.l2 and is multi-exonic . 

There is also a pseudogene which shares a high degree of *' 
homology with IBl cDNA and is located on chromosome 17. 
The results above also show that no other IBl -related; 
homologue is present in human genome suggesting that, if 

3 0 JIP-1 is present in humans, it represents an IBl isoforrr 

encoded by the IBl gene. 

Apart from its 47-residue insert, rat IBl was 
previously shown to be 97% identical to mouse JIP-1. This 
striking inter-species identity is further reinforced by 
35 the present observations which show a 94% identity between 

human and rat or mouse IBl. The highly conserved structure 
of IBl in mammals is evidence of ' the biological importance 
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of IBl. 

A sequence highly homologous to human IBl cDNA was 
identified in human genome and was mapped to chromosome 
17q21-22. Two observations support the conclusion that 
5 this sequence represents an IBI -pseudogene. First, despite 

the presence of two potential ORF's of 737 and 657 
nucleotides, respectively, this sequence contained numerous 
stop codons . Second, the data shows that these potential 
ORF's are not translated into proteins once transfected 

10 into COS-1 cells, even when placed in front of a strong CMV 

promoter, suggesting the lack of correct initiating sites. 
In addition, it is of interest that this sequence contains 
the 141-bp sequence which is missing in JIP-1. It is thus 
very \inlikely that JIP-1 is encoded by this sequence. 

15 Our data show that, apart from the IBl pseudogene, the 

presence of additional TBI- related gene in human genome is 
unlikely. In addition, we show that the 141-bp insert 
which distinguishes JIP-1 from IBl spans over two exons. 
Taken together, these observations raise the questions as 

20 to whether JIP-1 is expressed in human tissues, and if so, 

what is the mechanism responsible for the generation of the 
transcript. The clone containing the 3' end of human IBl 
cDNA was obtained from a human insulinoma cDNA library and 
contained the insert. RT-PCR amplifications of this 

25 library was performed using oligonucleotides flanking the 

insert, and only one amplicon containing the insert was 
obtained. While it might be the case that JIP-1 is 
expressed in other tissues or during embryonic development, 
JIP-1 does not appear to be a "simple" spliced variant of 

3 0 the IBl gene. Potential splice donor/acceptor sites are 

present in exons 8 and 9. However, splicing at these sites 
would result in a frameshift and a truncated protein. 



wo 98/44106 



PCT/GB98/00972 



76 

Table 1 

Sequence of the Oligonucleotides Used to Characterize the 
Human IBl Gene. 



Name of the Oligonucleotides 

01 igo-dT- anchor primer 

PGR anchor primer 

RACE-1 

RACE -2 

RACE -3 

RACE- 4 

RACE- 5 

RACE- 6 

IB1-787F 

IB1-1326R 

IB1-1346F 

IB1-1602R 

hIBl-intrF : 

hlBl-924R : 



5' to 3^ Secaience 
GACCACGCGTATCGATGTCGAC (T) ^^V 
GACCACGCGTATCGATGTCGAC 
ATCAGGTCCATCTGCAGCATCTC 
GCAAGGGCTGTCGGTGGAGGTGCCTCGA 
TCCATCTGCATCTCGGCCT 
GGACAGGGTGTCTTTGC 
GGGCAGCTCATCGCTCAGGCA 
GTTCATGCGGTGGTGTCTGCT 
CATCGAGACCGAATCCAC 
GTTCAGGAATTTCTTGGAGAAATG 
TGATGAACCCGACGTCCAT 
ACCAGTGCGCATGTTGTAGG 
CCTGCCTTCATGACCTGCCTG 
GGATTCGGTCTCGATGCGAG 
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Claims : 

1. A substance which is an isolated polypeptide 
comprising a polypeptide having the amino acid sequence set 
out in figure lA. 

5 

2. A substance which is an isolated polypeptide 
comprising a polypeptide having the amino acid sequence set 
out in figure IF, 

10 3, A substance which is an isolated polypeptide having 

greater than 80% amino acid sequence identity with the 
amino acid sequence set out in figure lA or IF. 

4. The substance of claim 3 which is a polypeptide having 
15 greater than 95% sequence identity with the amino acid 

sequence set out in figure lA or IF. 

5. The substance of claim 3 or claim 4 which is a 
polypeptide having greater than 98% sequence identity with 

20 the amino acid sequence set out in figure lA or IF. 

6. A substance which is an isolated polypeptide which is 
a mutant, variant, derivative or allele of the polypeptide 
of any one of claims 1 to 5 . 

25 

7. A substance which is a fragment of a polypeptide 
having the amino acid sequence set out in figure lA or IF 
which exhibits a biological property of full length IBl 
protein. 

30 

8. The substance of claim 8 wherein the biological 
activity is the binding of the polypeptide to GTII nucleic 
acid- 

35 9. A substance which is a fragment of a polypeptide 

having the amino acid sequence in figure lA or IF, the 
fragment including the domain from amino acids 566-612 of 
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the sequence shown in figure lA or the domain from amino 
acids 563-509 of the sequence shown in figure IF, or an 
active portion of either of the domains. 

5 10. The substance of any one of claim 7 to 9 wherein the 

fragment is at least 10 amino acids in length. 

11. The substance of any one of claim 7 to 10 wherein the 
fragment is at least 2 0 amino acids in length. 

10 

12 . An isolated nucleic acid molecule encoding a 
polypeptide of any one of claims 1 to 11. 

13. An isolated nucleic acid molecule having the nucleic 
15 acid sequence set out in figure lA. 

14. An isolated nucleic acid molecule having the nucleic 
acid sequence set out in figure IE. 

20 15. An isolated nucleic acid molecule having greater than 

90% sequence homology to the nucleic acid sequence of 
figure lA or IE. 

16 . An expression vector comprising a nucleic acid 
25 molecule of any one of claims 12 to 15, operably linked to 

control sequences to direct its expression, 

17. Host cells transformed with an expression vector of 
claim 16. 

30 

18. A method of producing IBl polypeptide comprising 
culturing the host cells of claim 17 and isolating the 
polypeptide thus produced. 



35 



19. A cell line for transplantation into a patient, the 
cell line being transformed with nucleic acid encoding an 
IBl polypeptide of any one of claims 1 to 11, the cell line 
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being capable of producing functional IBl polypeptide. 

20. The cell line of claim 19 which is encapsulated by a 
biocompatible polymer coating, so that the IBl polypeptide 

5 produced by the cell line can be released through the 
polymeric coating. 

21. A pharmaceutical composition comprising one or more 
IBl polypeptides of any one of claims 1 to 11. 

10 

22. An IBl polypeptide of any one of claims 1 to 11 for 
use in a method of medical treatment . 

23 . An IBl nucleic acid molecule of any one of claims 12 
15 to 15 for use in a method of medical treatment. 

24. Use of an IBl polypeptide of any one of claim 1 to 11 
in the preparation of medicament for the treatment of 
diabetes . 

20 

25. Use of an IBl antagonist in the preparation of a 
medicament for the treatment of a neurological disorder, 
wherein the IBl antagonist inhibits an anti-apoptotic 
activity of IBl polypeptide. 

25 

26. The use of claim 25 wherein the neurological disorder 
is a neuroblastoma or glioblastoma. 

27. Use of an IBl polypeptide of any one of claims 1 to 11 
30 in the preparation of a medicament for the treatment of a 

neurodegenerative disorder. 

28. Use of an IBl antagonist in the preparation of a 
medicament for the treatment of cancer wherein the 

35 medicament promotes apoptosis in cancer cells. 

29. Use of an IBl polypeptide in the preparation of a 
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medicament for the treatment of cancer, wherein the 
medicament helps to protect healthy cells surrounding 
cancer cells. 

5 30, Use of an IBl polypeptide of any one of claims 1 to 11 

in the preparation of a medicament for inhibiting cells 
from undergoing apoptosis. 

31. Use of an IBl antagonist in the preparation of a 
10 medicament for the promoting apoptosis in cells. 

32. An antibody which is capable of specifically binding 
to an IBl polypeptide of any one of claims 1 to 11. 

15 33. Use of an IBl polypeptide of any one of claim 1 to 11 

in a method of screening for test siibstances having an IBl 
biological activity . 

34. The use of claim 33 wherein the IBl polypeptide 
20 includes the domain from amino acids 566-612 of the 

sequence shown in figure lA or the domain from amino acids 
563-609 of the sequence shown in figure IF, or an active 
portion of that domain. 

25 35. Use of an IBl polypeptide of any one of claims 1 to 11 

in a method of screening for substances which are specific 
binding partners of the IBl polypeptide. 

36. A method of screening for substances which mimic an 
30 activity of an IBl polypeptide of any one of claims 1 to 

11, the method comprising contacting a test substance with 
an IBl specific binding partner, determining whether the 
test substance binds to the specific binding partner. 

35 37. The method of claim 36 comprising the further step of 

determining whether a test substance binding to the 
specific binding partner has an IBl biological activity. 
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38. A method of screening for substances which affect or 
modulate a biological activity of an IBl polypeptide of any 
one of claims 1 to 11, the method comprising contacting one 
or more test substances with an IBl polypeptide in a 
reaction medium, testing the activity of treated IBl 
polypeptide and comparing that activity with the activity 
of the IBl polypeptide in a comparable reaction medium 
without the test substance or substances, 

39. A method for screening test substances for a 
biological activity of an IBl polypeptide of any one of 
claims 1 to 11, the method errploying a cell line which does 
not express IBl polypeptide, the cell line being 
transformed with an expression vector comprising a reporter 
gene under the specific control of GTII promoter, wherein 
the method comprises exposing cell line to a test substance 
and determining whether expression of the reporter gene 
occurs . 

40. Use of a nucleic acid sequence of any one of claim 12 
to 15 to screen for binding partners of the nucleic acid 
sequence . 

41. Use of a nucleic acid sequence of any one of claims 12 
to 15 in the design of primers for use in the polymerase 
chain reaction. 

42. A method for determining the presence of IBl nucleic 
acid and/or mutations within an IBl nucleic acid sequence 
in a test sample, the method comprising detecting the 
hybridisation of test sample nucleic acid to a nucleic acid 
probe based on the IBl nucleic acid sequences set out in 
figures lA or IE. 

43. A method for determining the presence of IBl nucleic 
acid and/or mutations within an IBl nucleic acid sequence 
in a test sample, the method comprising: 



10 



15 
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(a) comparing the sequence of nucleic acid in the 
sample with the IBl nucleic acid, sequence to determine 
whether the sarrple from the patient contains mutations; or, 

(b) determining the presence in a sample from a 
patient of the polypeptide encoded by the IBl gene and, if 
present, determining whether the polypeptide is full 
length, and/or is mutated, and/or is expressed at the 
normal level; or, 

(c) using DNA fingerprinting to compare the 
restriction pattern produced when a restriction enzyme cuts 
a sample of nucleic acid from the patient with the 
restriction pattern obtained from normal IBl gene or from 
known mutations thereof; or, 

(d) using a specific binding member capable of 
binding to a IBl nucleic acid sequence (either a normal 
sequence or a known mutated sequence) , the specific binding 
member comprising nucleic acid hybridisable with the IBl 
sequence, or substances comprising an antibody domain with 
specificity for a native or mutated IBl nucleic acid 
sequence or the polypeptide encoded by it, the specific 
binding member being labelled so that binding of the 
specific binding member to its binding partner is 
detectable; or, 

(e) using PGR involving one or more primers based on 
normal or mutated IBl gene sequence to screen for normal or 
mutant IBl gene in a sample from a patient. 
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Figure ID (i) 
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Figure ID (ii) 

hIB-1 643 NISFCGYHPKNNKYFGFITKHPADHRFACHVFVSEDSTKAIiAESVGRAFQ 692 

MlllllllillllllllllllilllllillMIIIIIIIIMIIIIIM 

rIBl 646 NISFCGYHPKNNKYFGFITKHPADHRFACHVFVSEDSTKALAESVGRAFQ 695 

hIB-1 693 QFYKQFVEYTCPTEDIYLE 711 

IIIIIIIIIIIMMIIil 
rIBl 696 QFYKQFVEYTCPTEDIYLE 714 
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Figure IE (i) 

1 ATGGCGGAGC GAGAAAGCGG CGGCCTGGGA GGGGGGGCCG CGTCCCCGCC 

51 CGCCGCCTCC CCGTTCCTGG GGCTGCACAT CGCTTCGCCT CCCAATTTCA 

101 G«GCTCACCCA TGACATCAGC CTGGAGGAGT TTGAGGATGA AGACCTCTCG 

151 GAGATCACTG ATGAGTGTGG CATCAGCTTA CAGTGCAAAG ACACCCTGTC 

201 CTTACGG«CCC CCGCGCGCCG GGCTGCTCTC TGCGGGCGGC GGCGGCGCGG 

251 GGAGCCGGTT GCAGGCCGAG ATGCTGCAGA TGGACCTGAT CGACGCGACG 

301 GGGGACACTC CCGGGGCCGA GGACGACGAG GAGGACGACG ACGAGGAGCG 

3 51 CGCGGCCCGG CGGCCGGGAG CGGGGCCGCC CAAGGCCGAG TCCGGCCAGG 

4 01 AGCCGGCGTC CCGCGGCCAG GGCCAGAGCC AAGGCCAGAG CCAGGGCCCG 
4 51 GGCAGCGGGG ACACGTACCG GCCCAAGCGG CCCACCACGC TCAACCTCTT 
501 TCCGCAGGTG CCGCGGTCTC AG«GACACACT GAATAATAAT TCTCTGGGCA 
551 AAAAGCACAG TTGGCAGGAT CGGGTGTCTC GATCATCCTC ACCCCTGAAG 
601 ACAG^GGGAGC AGACACCACC GCATGAACAC ATCTGCCTGA GCGATGAGCT 
651 GCCCCCCCAG AGCGGCCCCG CCCCCACCAC AGATCGAGGC ACCTCCACCG 
701 ACAGCCCTTG CCGCCGCAGC ACAGCCACCC AGATGGCACC TCCGGGTGGT 
751 CCCCCTGCTG CCCCGCCTGG GGGTCGGGGC CACTCGCATC GAGACCGAAT 
801 CCACTACCAG GCCGATGTGC GACTAGAGGC CACTGAGGAG ATCTACCTGA 
851 CCCCAGTGCA GAGGCCCCCA GACGCTGCAG AGCCCACCTC CGCCTTCCTG 
901 CCGCCCACTG AGAGCCGGAT GTCAGTCAGC TCCGATCCAG ACCCTGCCGC 
951 CTACCCCTCC ACGGCAGGGC GGCCGCACCC CTCCATCAGT GAAGAGGAAG 

10 01 AGGGCTTCGA CTGCCTGTCG TCCCCAGAGC GGGCTGAGCC CCCAGGCGGA 

1051 GGGTGGCGGG GGAGCCTGGG GGAGCCGCCG CCACCTCCAC GGGCCTCTCT 

1101 GAGCTCGGAC ACCAGCGCCC TGTCCTATGA CTCTGTCAAG TACACGCTGG 

1151 TGGTAGATGA GCATGCACAG CTGGAGCTGG TGAGCCTGCG GCCGTGCTTC 

1201 GGAGACTACA GTGACGAGAG TGACTCTGCC ACCGTCTATG ACAACTGTGC 

12 51 CTCCGTCTCC TCGCCCTATG AGTCGGCCAT CGGAGAGGAA TATGAGGAGG 

1301 CCCCGCGGCC CCAGCCCCCT GCCTGCCTCT CCGAGGACTC CACGCCTGAT 
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Figure IE (ii) 

1351 GAACCCGACG TCCATTTCTG CAAGAAATTC CTGAACGTCT TCATGAGTGG 
1401 CCGCTCCCGC TCCTCCA«GTG CTGAGTCCTT CGGGCTGTTC TCCTGCATCA 
1451 TCAACGGGGA GGAGCAGGAG CAGACCCACC GGGCCATATT CAG«GTTTGTG 
1501 CCTCGACACG AAGACGAACT TGAGCTGGAA GTG6ATGACC CTCTGCTAGT 
1551 GGAGCTCCAG GCTGAAGACT ACTGGTACGA GGCCTACAAC ATGCGCACTG 
1601 GTGCCCGGGG TGTCTTTCCT GCCTATTACG CCATCGAGGT CACCAAGGAG 
1651 CCCGAGCACA TGGCAG»CCCT GGCCAAAAAC AGTGACTGGG TGGACCAGTT 
1701 CCGGGTGAAG TTCCTG6GCT CAGTCCAGGT TCCCTATCAC AAGGGCAATG 
1751 ACGTCCTCTG TGCTGCTATG CAAAAG»ATTG CCACCACCCG CCGGCTCACC 
1801 GTGCACTTTA ACCCGCCCTC CAGCTGTGTC CTGGAGATCA GCGTGCGGGG 
1851 TGTGAAGATA GGCGTCAAGG CCGATGACTC CCAGGAGGCC AAG»GGGAATA 
1901 AATGTAGCCA CTTTTTCCAG TTAAAAAACA TCTCTTTCTG CGGATATCAT 
1951 CCAAAGAACA ACAA«GTACTT TGGGTTCATC ACCAAGCACC CCGCCGACCA 
2001 CCGGTTTGCC TGCCACGTCT TTGTGTCTGA AGACTCCACC AAAGCCCTGG 
2051 CAGAGTCCGT GGG«GAGAGCA TTCCAGCAGT TCTACAAGCA GTTTGTGGAG 
2101 TACACCTGCC CCACAGAAGA TATCTACCTG GAGTAG 
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Amino acids sequence of human IBl 

1 MAERESGGLG GGAASPPAAS PFLGLHIASP PNFRLTHDIS L2EFEDEDLS 

51 EITDECGISL QCKDTLSLRP PRAGLLSAGG GGAGSRLQAS MLQMDLIDAT 

101 GDTPGAEDDE EDDDEERAAR RPGAGPPKAE SGQEPASRGQ GQSQGQSQGP 

151 GSGDTYRPKR PTTLNLFPQV PRSQDTLNNN SLGKKHSWQD RVSRSSSPLK 

2 01 TGEQTPPHEH ICLSppLPPQ SGPAPTTDRG TSTDSPCRRS TATQMAPPGG 
251 PPAAPPGGRG HSHRDRIHYQ ADVRLEATEE INLTPVQRPP DAAEPTSAFL 

3 01 PPTESRMSVS SDPDPAAYPS TAGRPHPSIS EEEEGFDCLS SPERAEPPGG 
3 51 GWRGSLGEPP PPPRASLSSD TSALSYDSVK YTLWDEHAQ LELVSLRPCF 
401 GDYSDESDSA TVYDNCASVS SPYESAIGEE YEEAPRPQ?? ACLSEDSTPD 
451 EPDVHFSKKF LNVFMSGRSR SSSAESFGLF SCIINGEEQE QTKRAIFRFV 
501 PRHEDELELE VDDPLLVELQ AEDYWYEAYN MRTGARGVFP AYYAIEVTKE 
551 PEHMAALAKN SDWVDQFRVK FLGSVQVPYH KGNDVLCAAM QKIATTRRLT 
601 VHFNPPSSCV LEISVRGVKI GVBCADDSQEA KGKKCSHFFQ LKWISFCGYH 
651. PKNNBTYFGFI TKHPADHRFA CHVFVSEDST KAU^ESVGRA FQQFYKQFVE 
701 YTCPTEDIYL E* 
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